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(57) Abstract: Isolated polynucleotides are provided. Each of the isolated polynucleotides comprise a nucleic acid sequence encod- 
ing a polypeptide having an amino acid sequence at least 80 % homologous to SEQ ID NO: 26, 106, 107, 109, 110, 112, 114, 115, 
118, 119, 122, 123, 124, 126, 95 or 96, wherein the polypeptide is capable of regulating cotton fiber development. Also provided 
are methods of using such polynucleotides for improving fiber quality and/or yield of a fiber producing plant, as well as methods of 
using such polynucleotides for producing plants having increased biomass/vigor/yield. 
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POLYNUCLEOTIDES AND POLYPEPTIDES INVOLVED IN PLANT FIBER 
DEVELOPMENT AND METHODS OF USING SAME 

5 

FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to polynucleotides and polypeptides involved in 
plant-fiber development and methods of using same. 

The present invention relates to a novel computational approach that utilizes 

10 comparative genomics to identify genes which play a role in fiber development. 

Cotton and cotton by-products provide raw materials that are used to produce 
a wealth of consumer-based products in addition to textiles including cotton 
foodstuffs, livestock feed, fertilizer and paper. The production, marketing, 
consumption and trade of cotton-based products generate an excess of $100 billion 

15 annually in the U.S. alone, making cotton the number one value-added crop. 

It is estimated that the use of cotton as a fiber by humans dates back 7000 
years in Central America and 5000 years in India. Even with the growth of synthetic 
fibers in the last 50 years, cotton still accounts for approximately 50 % of the world's 
textile fiber [Agrow Reports, Global Seed markets DS208, October 2000]. 

20 Even though 90 % of cotton's value as a crop resides in the fiber (lint), yield 

and fiber quality has declined, especially over the last decade [Meredith (2000), Proc. 
World Cotton Research Conference II, Athens, Greece pp.97-101]. This decline has 
been attributed to general erosion in genetic diversity of cotton varieties, and an 
increased vulnerability of the crop to environmental conditions [Bowman et al., Crop 

25 Sci. 36:577-581 (1996); Meredith, supra]. 

There are many varieties of cotton plant, from which cotton fibers with a range 
of characteristics can be obtained and used for various applications. Cotton fibers 
may be characterized according to a variety of properties, some of which are 
considered highly desirable within the textile industry for the production of 

30 increasingly high quality products and optimal exploitation of modem spinning 
technologies. Commercially desirable properties include length, length uniformity, 
fineness, maturity ratio, decreased fuzz fiber production, micronaire, bundle strength, 
and single fiber strength. Much effort has been put into the improvement of the 
. characteristics of cotton fibers mainly focusing on fiber length and fiber fineness. In 

35 particular, there is a great demand for cotton fibers of specific lengths. 
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Methods for improving the characteristics or yield of cotton fibers can be 
classified into the following three categories: 

1 . Variety improvement by cross breeding 

This method has been utilized most widely so far. At present, almost all the 
cultivated varieties of cotton plant are bred by this method. However, improvement 
of cotton fiber yield using traditional breeding is relatively slow and inefficientand the 
degree of variability which can be achieved is limited. 

2. Treatment with plant hormones 

Plant hormones such as auxin, gibberellin, cytokinin and ethylene have been 
widely used in field crops or horticultural products. The influence of plant hormones, 
particularly gibberellin, auxin and brassinolide, on the fiber characteristics of cotton 
plants is known [e.g. U.S. Pat. No. 5880110 produces cotton fibers with improved 
fiber characteristics by treatment with brassinosteroids]. However, no measurable 
effect has been documented, making practical use of these hormones on a large scale 
highly unlikely. 

3. Variety improvement by genetic engineering: 

The broad acceptance of genetically engineered cotton in the leading 
producing countries and the fact that it is a non-food crop make it an attractive 
candidate for genetic engineering for improvement of fiber yield and/or quality. 

In recent years, remarkable progress has been made in plant genetic 
engineering, as a result several cases of successful variety improvement of 
commercially important crop plants have been reported (e.g., cotton, soybean, com, 
canola, tomato). For example, methods of improving insect resistance by the 
introduction of a gene coding for BT toxin (i.e., insecticidal protein toxin produced by 
Bacillus thuringiensis) in a cotton plant, have been developed and put to practical use. 
In addition, cotton plants with improved herbicide (Glyphosate) resistance have been 
genetically engineered by the introduction of a gene coding for 5-enol-pyruvil- 
shikimic acid 3-phosphate synthetase. 

The availability and success of plant genetic engineering combined with the 
fact that cotton is an excellent candidate for genetic manipulation via recombinant 
techniques have led researchers to postulate that if a gene associated with an 
improved cotton fiber property could be identified, it could be up-regulated using 
recombinant techniques thus improving the characteristics or yield of cotton fibers* 
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w Conversely, if a gene associated with a decline in a cotton fiber property could be 

identified, it could be down-regulated using gene silencing methods. For this 
purpose, the mechanisms of fiber elongation and formation must be elucidated on the 
genetic level and genes closely associated with these mechanisms must be identified. 
5 A cotton fiber is composed of a single cell that has differentiated from an 

epidermal cell of the seed coat, developing through four stages, i.e., initiation, 
elongation, secondary cell wall thickening and maturation stages. More specifically, 
the elongation of a cotton fiber commences in the epidermal cell of the ovule 
immediately following flowering, after which the cotton fiber rapidly elongates for 
10 approximately 21 days. Fiber elongation is then terminated, and a secondary cell wall 
is formed and grown through maturation to become a mature cotton fiber. 

Several candidate genes have been isolated which are associated with the 
elongation and formation of cotton fibers. For example, five genes from cotton plants 
have been identified that are specifically expressed at the cotton fiber elongation stage 
15 by differential screening method and differential display method, [U.S. Pat. No. 
5,880,100 and U.S. patent applications Ser. Nos. 08/580,545, 08/867,484 and 
09/262,653]. 

WO0245485 describes methods and means to modulate fiber quality in fiber- 
producing plants, such as cotton, by modulating sucrose synthase (a sugar important 
20 for cell wall synthesis) activity and/or expression in such plants. 

U.S. Pat. No. 6,472,588 and WO01 17333 provide methods for increasing the 
quality of cotton fiber produced from a cotton plant by transformation with a DNA 
encoding sucrose phosphate synthase. The fiber qualities include strength, length, 
fiber maturity ratio, immature fiber content, fiber uniformity and micronaire. 
25 WQ9508914 discloses a fiber producing plant comprising in its genome a 

heterologous genetic construct. The genetic construct comprises a fiber-specific 
promoter and a coding sequence encoding a plant peroxidase, such as a cotton 
peroxidase. 

W09626639 provides methods whereby an ovary specific promoter sequence 
30 is utilized to express plant growth modifying hormones in cotton ovule tissue. The 
methods permit the modification of the characteristics of boll set in cotton plants and 
provide a mechanism for altering fiber quality characteristics such as fiber dimension 
and strength. 
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U.S. Pat. No. 5,981,834, U.S. Pat. No. 5,597,718, U.S. Pat. No. 5,620,882, 
U.S. Pat. No. 5,521,708 and U.S. Pat. No. 5,495,070 all disclose a method for 
genetically engineering a fiber-producing plant and the identification of cDNA clones 
useful for identifying fiber genes in cotton. The cDNA clones are useful in developing 
5 corresponding genomic clones from fiber producing plants to enable genetic 
engineering of cotton and other plants using these genes. Coding sequences from 
these isolated genes are used in sense or antisense orientation to alter the fiber 
characteristics of transgenic fiber producing plants. 

U.S. patent applications U.S. 2002049999 and U.S. 2003074697 both disclose 
10 cotton plants of the genus Gossypium with improved cotton fiber characteristics. The 
cotton plant has an expression cassette containing a gene coding for an enzyme 
selected from the group consisting of endoxyloglucan transferase, catalase and 
peroxidase so that the gene is expressed in cotton fiber cells to improve the cotton 
fiber characteristics. 

15 WO 01/40250 provides methods for improving cotton fiber quality by 

modulating transcription factor gene expression. 

WO 96/40924 provides novel DNA constructs which may be used as 
molecular probes or alternatively inserted into a plant host to provide for modification 
of transcription of a DNA sequence of interest during various stages of cotton fiber 

20 development The DNA constructs comprise a cotton fiber transcriptional initiation 
regulatory region associated with a gene, which is expressed in cotton fiber. Also 
provided is a novel cotton having a cotton fiber which has a natural color. The color 
was achieved by the introduction and expression in cotton fiber cell of a pigment gene 
construct. 

25 EP0834566 provides a gene which controls the fiber formation mechanism in 

cotton plant and which can be used for industrially useful improvement. 

However, beside Sucrose Synthase, there is no evidence to date that the 
, expression of any particular gene plays, an essential role in cotton fiber formation or 
enhanced fiber characteristics. 
30 Thus, there remains a need for identifying other genes associated with fiber 

characteristics of cotton plants and a more thorough search for quality-related genes is 
required. 
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While reducing the present invention to practice the present inventors devised 
and employed a novel computational approach that utilizes comparative genomics to 
identify genes which play a pivotal role in fiber development. As demonstrated 
herein, expression of such genes correlates with fiber length and their overexpression 
5 is sufficient to modify tomato seed hair, an ultimate model for cotton fibers. These 
results suggest that polynucleotides of the present invention can be used for 
generating transgenic cotton plants which are characterized by fibers of desired 
length. 

10 SUMMARY OF THE INVENTION 

According to one aspect of the present invention there is provided an isolated 
polynucleotide comprising a nucleic acid sequence encoding a polypeptide having an 
amino acid sequence at least 80 % homologous to SEQ ID NO: 26, 106, 107, 109, 
110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 or 96, wherein the polypeptide is 
capable of regulating cotton fiber development. 

According to further features in preferred embodiments of the invention 
described below, the nucleic acid sequence is selected from the group consisting of 
SEQ ID NOs. 1, 2, 4, 5, 7, 9, 10, 16, 17, 20, 21, 22, 24, 25, 27 and 13. 

According to still further features in the described preferred embodiments the 
polypeptide is as set forth in SEQ ID NO. 26, 106, 107, 109, 110, 112, 114, 115, 118, 
1 19, 122, 123, 124, 126, 95 or 96. 

According to still further features in the described preferred embodiments the 
amino acid sequence is as set forth in SEQ ID NO. 26, 106, 107, 109, 110, 112, 114, 
115, 118, 119, 122, 123, 124, 126, 95 or 96. 

According to still further features in the described preferred embodiments the 
cotton fiber development comprises fiber formation. 

According to still further features in the described preferred embodiments the 
cotton fiber development comprises fiber elongation. 

According to another aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence at least 80 % identical to 
SEQ ID NO: 85 or 91, wherein the nucleic acid sequence is capable of regulating 
expression of at least one polynucleotide sequence operably linked thereto in an ovule 
endothelial cell. 
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According to still further features in the described preferred embodiments the 
ovule endothelial cell is of a plant fiber or a trichome. 

According to yet another aspect of the present invention there is provided an 
oligonucleotide capable of specifically hybridizing to the isolated polynucleotide. 

According to another aspect of the present invention there is provided a 
nucleic acid construct comprising the isolated polynucleotide. 

According to still further features in the described preferred embodiments the 
nucleic acid construct further comprising at least one cis-acting regulatory element 
operably linked to the isolated polynucleotide. 

According to still further features in the described preferred embodiments the 
polynucleotide sequence is selected from the group consisting of SEQ ID NOs: 1, 2, 
4, 5, 7, 9, 10, 16, 17, 20, 21, 22, 24, 25, 27 and 13. 

According to still further features in the described preferred embodiments the 
cis-acting regulatory element is as set forth in SEQ ED NO: 74, 75, 85 or 91 or 
functional equivalents thereof. 

According to an additional aspect of the present invention there is provided a 
transgenic cell comprising the nucleic acid construct. 

According to yet an additional aspect of the present invention there is provided 
a transgenic plant comprising the nucleic acid construct. 

According to yet another aspect of the present invention there is provided a 
method of improving fiber quality and/or yield of a fiber producing plant, the method 
comprising regulating an expression level or activity of at least one polynucleotide 
encoding a polypeptide having an amino acid sequence at least 80 % homologous to 
SEQ ID NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 
or 96 in the fiber producing plant, thereby improving the quality and/or yield of the 
fiber producing plant. 

According to still further features in the described preferred embodiments the 
quality of the fiber producing plant comprises at least one parameter selected from the 
group consisting of fiber length, fiber strength, fiber weight per unit length, maturity 
ratio, uniformity and micronaire. 

According to still further features in the described preferred embodiments the 
regulating expression or activity of the at least one polynucleotide is up-regulating. 
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According to still further features in the described preferred embodiments the 
up-regulating is effected by introducing into the cotton the nucleic acid construct. 

According to still further features in the described preferred embodiments the 
regulating expression or activity of the at least one polynucleotide is down-regulating. 

According to still further features in the described preferred embodiments the 
down-regulating is effected by gene silencing. 

According to still further features in the described preferred embodiments the 
gene silencing is effected by introducing into the cotton the oligonucleotide. 

According to still further features in the described preferred embodiments the 
fiber producing plant is selected from the group consisting of cotton, silk cotton tree 
(Kapok, Ceiba pentandra), desert willow, creosote bush, winterfat, balsa, ramie, 
kenaf, hemp, roselle, jute, sisal abaca and flax. 

According to still an additional aspect of the present invention there is 
provided a method of increasing a biomass of a plant, the method comprising 
regulating an expression level or activity of at least one polynucleotide encoding a 
polypeptide having an amino acid sequence at least 80 % homologous to SEQ ID NO: 
26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 or 96 in the 
plant, thereby increasing the biomass of the plant. 

According to still further features in the described preferred embodiments the 
plant is a monocot plant. 

According to still further features in the described preferred embodiments the 
plant is a dicot plant. 

According to a further aspect of the present invention there is provided a 
method of identifying genes which are involved in cotton fiber development, the 
method comprising: 

(a) providing expressed nucleic acid sequences derived from cotton fibers; 

(b) providing expressed nucleic acid sequences derived from an ovule 
tissue; 

(c) computationally assembling the expressed nucleic acid sequences of 
(a) and (b) to generate clusters; and 

(d) identifying clusters of the clusters which comprise expressed nucleic 
acid sequences of (a) and (b), thereby identifying genes which are 
involved in cotton fiber development. 
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According to still further features in the described preferred embodiments the 
method further comprising identifying genes which are differentially expressed in the 
cotton fiber following (d). 

According to still further features in the described preferred embodiments the 
differentially expressed comprises: 

(a) specific expression; and/or 

(b) change in expression over fiber development. 

According to yet an additional aspect of the present invention there is provided 
a method of producing an insect resistant plant, comprising regulating an expression 
level or activity of at least one polynucleotide encoding a polypeptide having an 
amino acid sequence at least 80 % homologous to SEQ ID NO: 26, 106, 107, 109, 
110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 or 96 in a trichome of the plant, 
thereby producing the insect resistant plant. 

According to still an additional aspect of the present invention there is 
provided a method of producing cotton fibers, the method comprising: 

(a) generating a transgenic cotton plant expressing at least one polypeptide 
having an amino acid sequence at least 80 % homologous to SEQ ED 
NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 
126, 95 or 96; and 

(b) harvesting the fibers of the transgenic cotton plant, thereby producing 
the cotton fibers. 

The present invention successfully addresses the shortcomings of the presently 
known configurations by providing genes involved in cotton fiber development and 
methods of using same. 

Unless otherwise defined, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. In case of conflict, the patent 
specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with reference to 
the accompanying drawings. With specific reference now to the drawings in detail, it 
is stressed that the particulars shown are by way of example and for purposes of 
5 illustrative discussion of the preferred embodiments of the present invention only, and 
are presented in the cause of providing what is believed to be the most useful and 
readily understood description of the principles and conceptual aspects of the 
invention. In this regard, no attempt is made to show structural details of the invention 
in more detail than is necessary for a fundamental understanding of the invention, the 
10 description taken with the drawings making apparent to those skilled in the art how the 
several forms of the invention may be embodied in practice. 

In the drawings: 

FIG. 1 is an illustration depicting the bioinformatic methodology of the present 
invention effected to identify genes which may be used to improve cotton fiber yield 
15 and quality. 

FIGs. 2a-d are bar graphs showing expression patterns of fiber specific genes 
(CT_11 Figure 2b), elongation associated genes (CT_1, Figure 2c) and initiation 
associated genes (CT_22, Figure 2d). 

FIG. 3 is a graph depicting expression of CTJ76 in varieties of cotton (G. 
20 hirsutum vaf Tamcot, Coker and Acala, and G. barbadense var Pima S5) plants, as 
determined by RT-PCR. 

FIG. 4 is a schematic illustration of the pPi binary plasmid. 

FIGs. 5a-l are photographs of wild-type and transgenic arabidopsis plants over- 
expressing genes of the present invention. Figure 5a shows two week old rosette of wt 
25 plants; Figure 5b shows two week old rosette of CT11 over-expressing arabidopsis 
plants; Figure 5c shows two week old roots of CT1 1; Figure 5d shows three week old 
wild type arabidopsis; Figure 5e shows three week old CT_20; Figure 5f shows three 
week old CT_22; Figure 5g shows 30 days old rosettes of wt and CT_9; Figure 5h 
shows 30 days inflorescence of wt and CT_9; Figure 5i shows two week old roots of 
30 CT9; Figure 5j shows 30 days old rosettes of wt and CT_40; Figure 5k shows rosette 
of 5 week old wt and CT81 over-expressing plants; Figure 51 shows a leaf of wt and 
CT81 over-expressing arabidopsis plants; 
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FIGs. 6a-e are photographs depicting wild-type and transgenic tomato plants 
over-expressing CT 20. Figure 6a shows a leaf of wild-type plant; Figure 6b shows a 
leaf of CT 20 transgenic tomato; Figure 6c shows seed hairs of WT and CT_20 over- 
expressing tomato plants; Figure 6d shows section of a wt tomato seed; Figure 6e 
5 shows section of a CTJ20 over-expressing tomato seed; Figure 6f seed hairs of WT 
and CT_82. 

FIGs. 7a-b are photographs depicting transgenic tomato plants over-expressing 
GUS under the expression of the CTJ2 promoter. Figure 7a is a cut through transgenic 
tomato fruit, over-expressing GUS under CT2 promoter in the mature green stage (x 5 
10 magnification). Figure 7b similar to Figure 7a showing x 25 magnification; 

FIGs. 8a-b are photographs depicting various magnifications of wild-type and 
transgenic tomato fruits or tomato seeds. Figure 8a is a single wild type tomato seed 
covered with seed hairs x 10 magnification; Figure 8b shows tomato seed over 
expressing expansin under 35S (x 10 magnification). 

15 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of polypeptides and polynucleotides encoding same 

which are involved in plant fiber development and which can be used to improve fiber 

quality and/or yield/biomass of a fiber producing plant. 
20 The principles and operation of the presehi invention may be better understood 

with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it is to be 

understood that the invention is not limited in its application to the details set forth in 

the following description or exemplified by the Examples. The invention is capable of 
25 other embodiments or of being practiced or carried out in various ways. Also, it is to 

be understood that the phraseology and terminology employed herein is for the 

purpose of description and should not be regarded as limiting. 

Cotton and cotton by-products provide raw materials that are used to produce 

a wealth of consumer-based products; in addition to textiles, cotton is used to produce 
30 foodstuffs, livestock feed, fertilizer and paper. The production, marketing, 

consumption and trade of cotton-based products generate an excess of $100 billion 

annually in the U.S. alone, making cotton the number one value-added crop. 
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Over the past decade cotton fiber production has sharply declined prompting 
cotton growers and researchers to look for approaches, which can be used to improve 
fiber yield and quality. 

Increasing fiber quality and/or yield under diverse environmental conditions 
5 will increase the profitability of cotton crop production and provide a new spectrum of 
material properties for exploitation by the processing industries. 

While reducing the present invention to practice, the present inventors have 
configured a novel computational approach that utilizes comparative genomics to 
identify genes which play a role in fiber development. Genes identified using this 
10 approach may be successfully used for generating transgenic plants which are 
featured by fibers of desired properties. 

Thus, according to one aspect of the present invention there is provided a 
method of identifying genes which are involved in cotton fiber development. 

As used herein the term "cotton" refers to a wild-type, a cultivated variety 
(e.g., hybrid) or a transgenic cotton (Gossypium) plant. 

As used herein the phrase "fiber development" refers to the development of 
the hair of the cotton seed. ' 

As used herein the term "development" when used in context of cotton fibers 
refers to initiation of the fiber and/or elongation thereof, as well as to the fiber 
secondary cell wall thickening and maturation. 

The method according to this aspect of the present invention is effected by: 

(a) providing expressed nucleic acid sequences derived from cotton fibers; 

(b) providing expressed nucleic acid sequences derived from an ovule 
tissue (i.e., a tissue developed from an ovary of a seed plant. Examples include, but 
are not limited to, carpels, seed coat, embryo, endosperm); 

(c) computationally assembling the expressed nucleic acid sequences of 
(a) and (b) to generate clusters; and 

(d) identifying clusters of said clusters which comprise expressed nucleic 
acid sequences of (a) and (b), thereby identifying genes which are involved in cotton 
fiber development 

Expressed nucleic acid sequences used as a potential source for identifying 
genes involved in cotton fiber development according to this aspect of the present 
invention are preferably libraries of expressed messenger RNA [i.e., expressed 
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sequence tags (EST), cDNA clones, contigs, pre-mRNA, etc.] obtained from tissue or 
cell-line preparations which can include genomic and/or cDNA sequence. 

Expressed nucleic acid sequences, according to this aspect of the present 
invention can be retrieved from pre-existing publicly available databases (see 
5 Example 1 of the Examples section which follows or private databases). 

Alternatively, the expressed nucleic acid sequences utilized by the present 
invention can be generated from sequence libraries (e.g., cDNA libraries, EST 
libraries, mRNA libraries and others). 

cDNA libraries are suitable sources for expressed sequence information. 
10 Generating a sequence database in such a case is typically effected by tissue or 

cell sample preparation, RNA isolation, cDNA library construction and sequencing. 

It will be appreciated that such cDNA libraries can be constructed from RNA 
isolated from whole plant, specific tissues, or cell populations. 

Once expressed sequence data is obtained from both cotton fibers and an ovule 
15 tissue, sequences may be clustered to form contigs. See Example 1 of the Examples 
section which follows 

Such contigs are then assembled to identify homologous sequences (of cotton 
fibers and ovule tissue) present in the same cluster, such contigs are considered to be 
involved in cotton fiber development. 
20 A number of commonly used computer software fragment read assemblers 

capable of forming clusters of expressed sequences are commercially available. 
These packages include but are not limited to, The TIGR Assembler [Sutton G. et al. 
(1995) Genome Science and Technology 1:9-19], GAP [Bonfield JK. et al. (1995) 
Nucleic Acids Res. 23:4992-4999], CAP2 [Huang X. et al. (1996) Genomics 33:21- 
25 31], The Genome Construction Manager [Laurence CB. Et al. (1994) Genomics 
23:192-201], Bio Image Sequence Assembly Manager, SeqMan [Swindell SR. and 
Plasterer JN. (1997) Methods Mol. Biol. 70:75-89], LEADS and GenCarta 
(Compugen Ltd. Israel). 

Once genes which are involved in cotton fiber development are identified their 
pattern of expression can be analyzed as described in Example 2 of the Examples 
section which follows, to thereby identify genes which are differentially expressed in 
the cotton fiber (i.e., specific expression) or during cotton fiber development (i.e., 
change in expression during cotton fiber development). 
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Methods of identifying differentially expressed genes are well known in the 

art. 

Using the above methodology, the present inventors were able to successfully 
identify genes which are involved in cotton fiber development. 

As is illustrated in the Examples section which follows genes identified using 
the teachings of the present invention can be classified into 6 functional categories 
according to their sequence homology to known proteins and enzymes (Table 3, 
below). The Two genes were classified into a cell fate commitment category: 
homologous to the MYB transcription factor and to GL3 which are known to be 
involved in trichome development in arabidopsis. The expression pattern of both 
genes and the phenotype of CT20 transgene both in arabidopsis and tomato Tl plants 
support their involvement mainly in the initiation phase. Two other genes (Table 3, 
above) are transcription factors from the MYB and MADS BOX families. Many 
studies demonstrated the function of these two transcription factor families as 
homeotic genes with key role in different developmental processes, among them are 
trichome and fiber morphogenesis (Suo. J. et al 2003, Ferrario S et al 2004). Their 
role in early stages of fiber development is supported also by their RNA expression 
pattern, which, is induced before, and during the day of anthesis. One gene belongs to 
the pathways of starch and sucrose metabolism. A recent work demonstrates that 
another gene (SUS), which, belongs to this pathway, is a limiting factor in both fiber 
initiation and development Another gene (Table 3, below) is classified as lipid 
transport whose RNA expression is highly induced during early fiber elongation stage 
fit to the fact that lipids are key components in fiber formation. Several genes (Table 
3, below) were classified either as genes involved in desiccation, salinity response 
stimulated by abscisic acid and genes involved in electron transfer. Out of them 3 
genes were selected by RNA expression pattern to be induced in the elongation stage. 

In view of the above and together with the experimental results which 
correlate gene expression with fiber length, it is suggested that genes of the present 
invention can be used to generate fiber producing plants with commercially desired 
fiber quality. 

Thus, the present invention encompasses polynucleotides identified using the 
present methodology and their encoded polypeptide as well as functional equivalents 
of the polypeptides identified, herein (i.e., , polypeptides which are capable of 
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regulating cotton fiber development, as can be determined according to the assays 
described in the Examples section which follows) and their coding sequences. Such 
functional equivalents can be at least about 70 %, at least about 75 %, at least about 
80 %, at least about 81 %, at least about 82 %, at least about 83 %, at least about 84 
%, at least about 85 %, at least about 86 %, at least about 87 %, at least about 88 %, at 
least about 89 %, at least about 90 %, at least about 91 %, at least about 92 %, at least 
about 93 %, at least about 94 %, at least about 95 %, at least about 75 %, at least 
about 75 %, at least about 75 %, at least about 75 %, say 100 % homologous to SEQ 
ID NO: 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 
121, 122, 123, 124, 125, 126, 95 or 96. 

Polynucleotides encoding functional equivalents can be at least about 70 %, at 
least about 75 %, at least about 80 %, at least about 81 %, at least about 82 %, at least 
about 83 %, at least about 84 %, at least about 85 %, at least about 86 %, at least 
about 87 %, at least about 88 %, at least about 89 %, at least about 90 %, at least 
about 91 %, at least about 92 %, at least about 93 %, at least about 94 %, at least 
about 95 %, at least about 75 %, at least about 75 %, at least about 75 %, at least 
about 75 %, say 100 % identical to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 16, 
17, 18, 19, 20, 21, 22, 23, 24, 25 or 27. 

Homology (e.g., percent homology) can be determined using any homology 
comparison software, including for example, the BlastP software of the National 
Center of Biotechnology Information (NCBI) such as by using default parameters. 

Identity (e.g., percent homology) can be determined using any homology 
comparison software, including for example, the BlastN software of the National 
Center of Biotechnology Information (NCBI) such as by using default parameters. 

As used herein the phrase "an isolated polynucleotide" refers to a single or 
double stranded nucleic acid sequences which is isolated and provided in the form of 
an RNA sequence, a complementary polynucleotide sequence (cDNA), a genomic 
polynucleotide sequence and/or a composite polynucleotide sequences (e.g., a 
combination of the above). 

As used herein the phrase "complementary polynucleotide sequence" refers to 
a sequence, which results from reverse transcription of messenger RNA using a 
reverse transcriptase or any other RNA dependent DNA polymerase. Such a sequence 



) 
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can be subsequently amplified in vivo or in vitro using a DNA dependent DNA 
polymerase. 

As used herein the phrase "genomic polynucleotide sequence" refers to a 
sequence derived (isolated) from a chromosome and thus it represents a contiguous 
5 portion of a chromosome. 

As used herein the phrase "composite polynucleotide sequence" refers to a 
sequence, which is at least partially complementary and at least partially genomic. A 
composite sequence can include some exonal sequences required to encode the 
polypeptide of the present invention, as well as some intronic sequences interposing 
10 therebetween. The intronic sequences can be of any source, including of other genes, 
and typically will include conserved splicing signal sequences. Such intronic 
sequences may further include cis acting expression regulatory elements. 

According to a preferred embodiment of this aspect of the present invention, 
the nucleic acid sequence is as set forth in SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 
13, 14, 15, 16, 17, 19, 21, 22, 23, 24, 25 or 26, 

According to another preferred embodiment of this aspect of the present 
invention, the isolated polynucleotide is as set forth in SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 16, 17, 18, 19, 20,21,22, 23,24, 25 or 27. 

According to yet another preferred embodiment of this aspect of the present 
invention, the polypeptide is as set forth in SEQ 106, 107, 108, 109, 110, 111, 

112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 95 or 96. 

According to still another preferred embodiment of this aspect of the present 
invention, the amino acid sequence is as set forth in SEQ ID NO: 106, 107, 108, 109, 
110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 
95 or 96. 

The isolated polynucleotides of this aspect of the present invention can also be 
qualified using a hybridization assay by incubating the isolated polynucleotides 
15 described above in the presence of oligonucleotide probe or primer under moderate to 
stringent hybridization conditions. 

Moderate to stringent hybridization conditions are characterized by a 
hybridization solution such as containing 10 % dextrane sulfate, 1 M NaCl, 1 % SDS 

and 5 x 10 6 cpm 32 P labeled probe, at 65 °C, with a final wash solution of 0.2 x SSC 
20 and 0.1 % SDS and final wash at 65°C and whereas moderate hybridization is 
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effected using a hybridization solution containing 10 % dextrane sulfate, 1 M NaCI, 1 
% SDS and 5 x 10 6 cpm 32 P labeled probe, at 65 °C, with a final wash solution of 1 x 
SSC and 0. 1 % SDS and final wash at 50 °C. 

Thus, the present invention encompasses nucleic acid sequences described 
5 hereinabove; fragments thereof, sequences hybridizable therewith, sequences 
homologous thereto, sequences encoding similar polypeptides with different codon 
usage, altered sequences characterized by mutations, such as deletion, insertion or 
substitution of one or more nucleotides, either naturally occurring or man induced, 
either randomly or in a targeted fashion. 

10 Since the polynucleotide sequences of the present invention encode previously 

unidentified polypeptides, the present invention also encompasses novel polypeptides 
or portions thereof, which are encoded by the isolated polynucleotides and respective 
nucleic acid fragments thereof described hereinabove. 

Thus, the present invention also encompasses polypeptides encoded by the 

15 polynucleotide sequences of the present invention. The amino acid sequences of 
these novel polypeptides are set forth in SEQ ID NO: 26, 106, 107, 109, 110, 112, 
114, 115, 118, 119, 122, 123, 124, 126, 95 or 96. 

The present invention also encompasses homologues of these polypeptides, 
such homologues can be at least about 70 %, at least about 75 %, at least about 80 %, 

20 at least about 81 %, at least about 82 %, at least about 83 %, at least about 84 %, at 
least about 85 %, at least about 86 %, at least about 87 %, at least about 88 %, at least 
about 89 %, at least about 90 %, at least about 91 %, at least about 92 %, at least 
about 93 %, at least about 93 %, at least about 94 %, at least about 95 %, at least 
about 96 %, at least about 97 %, at least about 98 %, at least about 99 %, or more say 

25 100 % homologous to SEQ ID NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 
122, 123, 124, 126, 95 or 96. 

The present invention also encompasses fragments of the above described 
polypeptides and polypeptides having mutations, such as deletions, insertions or 
substitutions of one or more amino acids, either naturally occurring or man induced, 

30 either randomly or in a targeted fashion. 

The ability of polynucleotides of the present invention and their products to 
regulate cotton fiber development can be determined directly on at least one structural 
parameter of a cotton fiber such as fiber length or fiber finesse, or fiber growth rate 
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(further described hereinbelow). However cotton fiber development can also 
determined indirectly such as by plant model systems for cotton fiber development. 
For example, its is well established that trichome cells and root hairs share common 
characteristics with cotton fiber cells, and as such can be used as model systems for 
cotton fiber development [Reviewed in Wagner. GJ. et al. (2004)], as demonstrated 
in details in Example 12 of the Examples section which follows. 

By analyzing expression profiles, the present inventors were able to determine 
the involvement of the biomolecular sequences (i.e., polynucleotides and 
polypeptides) of the present invention in fiber initiation and/or elongation. These 
results were further substantiated by establishing a correlation between gene 
expression and fiber length (see Example 7). 

These results suggest that biomolecular sequences of the present invention 
(e.g., polynucleotides, polypeptides, promoters, oligonucleotides, antibodies, also 
referred to herein as agents) can be used to improve fiber quality and/or yield of a 
fiber producing plant. 

Thus, according to yet another aspect of the present invention there is 
provided a method of improving fiber quality and/or yield of a fiber producing plant. 

The method of this aspect of the present invention is effected by regulating an 
expression level or activity of at least one polynucleotide or polypeptide of the present 
invention (described hereinabove) in the fiber producing plant, thereby improving the 
quality and/or yield of the fiber producing plant 

As used herein the phrase "fiber producing plant" refers to plants that share the 
common feature of having an elongated shape and abundant cellulose in thick cell 
walls, typically termed as secondary walls. Such walls may or may not be lignified, 
and the protoplast of such cells may or may be viable at maturity. Such fibers have 
many industrial uses, for example in lumber and manufactured wood products, paper, 
textiles, sacking and boxing material, cordage, brushes and brooms, filling and 
stuffing, caulking, reinforcement of other materials, and manufacture of cellulose 
derivatives. 

According to a preferred embodiment of this aspect of the present invention 
the fiber producing plant is cotton. 

The term "fiber" is usually inclusive of thick-walled conducting cells such as 
vessels and tracheids and to fibrillar aggregates of many individual fiber cells. Hence, 
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the term "fiber" refers to (a) thick-walled conducting and non-conducting cells of the 
xylem; (b) fibers of extraxylary origin, including those from phloem, bark, ground 
tissue, and epidermis; and (c) fibers from stems, leaves, roots, seeds, and flowers or 
inflorescences (such as those of Sorghum vulgare used in the manufacture of brushes 
and brooms). 

Example of fiber producing plants, include, but are not limited to, agricultural 
crops such as cotton, silk cotton tree (Kapok, Ceiba pentandra), desert willow, 
creosote bush, winterfat, balsa, kenaf, roselle, jute, sisal abaca, flax, com, sugar cane, 
hemp, ramie, kapok, coir, bamboo, Spanish moss and Agave spp. (e.g. sisal). 

As used herein the phrase "fiber quality" refers to at least one fiber parameter 
which is agriculturally desired, or required in the fiber industry (further described 
hereinbelow). Examples of such parameters, include but are not limited to, fiber 
length, fiber strength, fiber fitness, fiber weight per unit length, maturity ratio and 
uniformity (further described hereinbelow. 

Cotton fiber (lint) quality is typically measured according to fiber length, 
strength and fineness. Accordingly, the lint quality is considered higher when the 
fiber is longer, stronger and finer. 

As used herein the phrase "fiber yield" refers to the amount or quantity of 
fibers produced from the fiber producing plant. 

As used herein the term "improving" refers to at least about 5 %, at least about 
10 %, at least about 15 %, at least about 20 %, at least about 30 %, at least about 40 
%, at least about 50 %, change in fiber quality/yield as compared to a native plant 
(i.e., not modified with the biomolecular sequences of the present invention). 

As used herein the term "regulating" refers to up regulating, down regulating 
or a combination thereof. For example, when an increase in fiber number is desired 
the present invention can be effected by upregulating at least one polynucleotide of 
the present invention, which is involved in fiber initiation (e.g., SEQ ID NOs: 4, 10, 9, 
12, 16 and 25). Alternatively, when short fibers are desired such as for example, in 
corn, then the present invention is effected by down regulating at least one 
polynucleotide of the present invention which is involved in fiber elongation (e.g., 
SEQ ID NOs. 1, 2, 3, 5, 6, 7, 17, 18, 19, 20, 21, 22, 23, 24 and 27). Alternatively, the 
present invention can be effected by upregulating expression of at least one 
polynucleotide (such as involved in fiber elongation) and down regulating at least one 
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polynucleotide (such as involved in fiber initiation) of the polynucleotides of the 
present invention. In this manner it is feasible to obtain a fiber producing plant with 
improved fiber yield of each of short length. 

Up regulating an expression level of at least one of the polynucleotides of the 
5 present invention can be effected at the genomic level (e.g., activation of transcription 
by means of promoters, enhancers, or other regulatory elements), at the transcript 
level, or at the protein level. 

Following is a non-comprehensive list of agents capable of upregulating the 
expression level and/or activity of the biomolceular sequences (i.e., nucleic acid or 
10 protein sequences) of the present invention. \ 

An agent capable of upregulating expression of a polynucleotide of interest 
may be an exogenous polynucleotide sequence designed and constructed to express at 
least a functional portion thereof (e.g., improving fiber yield/quality, increasing 
biomass etc.). Accordingly, the exogenous polynucleotide sequence may be a DNA or 
15 RNA sequence encoding a polypeptide molecule, capable of improving fiber yield or 
quantity. Alternatively, the exogenous polynucleotide may be a cis-acting regulatory 
region (e.g., SEQ ID NO: 74, 75, 85, 88 or 91) which may be introduced into the plant 
to increase expression of any polynucleotide which is involved in fiber development 
(e.g., sucrose phosphate synthase, as described in U.S. Pat. No. 6,472,588). 
20 To express exogenous polynucleotides in plant cells, a polynucleotide 

sequence of the present invention is preferably ligated into a nucleic acid construct 
suitable for plant cell expression. Such a nucleic acid construct includes a cis-acting 
regulatory region such as a promoter sequence for directing transcription of the 
polynucleotide sequence in the cell in a constitutive or inducible manner. The 
25 promoter may be homologous or heterologous to the transformed plant/cell. 

Preferred promoter sequences which can be used in accordance with this 
aspect of the present invention are endothelial cell promoters. 

For example, promoter sequences of each of the polynucleotide sequences of 
the present invention may be preferably used in the nucleic acid constructs of the 
30 present invention. 

According to a preferred embodiment of this aspect of the present invention 
the promoter is at least about 80 %, at least about 81 %, at least about 82 %, at lelast 
about 83 %, at least about 84 %, at least about 85 %, at least about 86 %, at least 
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about 87 %, at least about 88 %, at least about 89 %, at least about 90 %, at least 
about 91 %, at least about 92 %, at least about 93 %, at least about 94 %, at least 
about 95 %, at least about 96 %, at least about 97 %, at least about 98 %, at least 
about 99 %, or 100 % identical to SEQ ID NO. 85 or 91, which is capable of 
5 regulating expression of at least one polynucleotide sequence operably linked thereto 
in an ovule endothelial cell (i.e., capable of exerting a regulatory effect on the coding 
sequence linked thereto). 

As is clearly illustrated in the Examples section which follows, such promoter 
sequences are capable of regulating expression of a coding nucleic acid sequence 

10 (e.g., GUS) operably linked thereto. 

Other examples of cotton fiber-enhanced promoters include those of the cotton 
fiber-expressed genes E6 (John et al., Plant Mol. Biol., 30:297-306 (1996) and John et 
al., Proc. Natl. Acad. Sci., 93:12768-12773 (1996) e), H6 (John et al., Plant Physiol., 
108:669-676, (1995)), FbL2A (Rinehart et al., Plant Physiol., 112:1331-1341 (1996) 

15 and John et al, Proc. Natl. AcadJ Sci. USA, 93:12768-12773 (1996)), rac (Delmer et 
al., Mol. Gen. Genet, 248:43-51 (1995)); CelA (Pear et al., Proc. Natl. Acad. Sci 
USA, 93:12637-12642 (1996)); CAP (Kawai et al., Plant Cell Physiol. 39:1380-1383 
(1998)); ACP (Song et al., Biochim. Biophys. Acta 1351:305-312 (1997); and LTP 
(Ma et al., Biochim. Biophys. Acta 1344:111-114 (1997)). Other cotton fiber 

20 specific promoters are disclosed in U.S. Pat No. 5,495,070. 

Other promoters which can be used in accordance with this aspect of the 
present invention are those that ensure expression only in specified organs, such as the 
leaf, root, tuber, seed, stem, flower or specified cell types such as parenchyma, 
epidermal, trichome or vascular cells. 

25 Preferred promoters for enhancing expression in trichome cells are disclosed 

in WO 2004/1 1 1 183, to Evogene Ltd. 

Preferred promoters enhancing expression in vascular tissue include the CAD 
2 promoter (Samaj et al., Planta, 204:437-443 (1998)), the Pt4Cll promoter (Hu et 
al., Proc. Natl. Acad. Sci. USA, 95:5407-5412 (1998)), the C4H promoter (Meyer et 

30 al., Proc. Natl. Acad. Sci. USA, 95:6619-6623 (1998)), the PtX3H6 and PtX14A9 
promoters (Loopstra et al., Plant Mol. Biol., 27:277-291 (1995)), the RolC promoter 
(Graham, Plant Mol. Biol., 33:729-735 (1997)), the Hvhspl7 promoter (Raho et al., J. 
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Expt Bot., 47:1587-1594 (1996)), and the COMT promoter (Capellades et aL, Plant 
Mol. Biol., 31:307-322 (1996)). 

Preferred promoters enhancing expression in stem tissue include pith 
promoters (Datta, Theor. Appl. Genet., 97:20-30 (1998) and Ohta et aL, Mol. Gen. 
5 Genet, 225:369-378 (1991)), and the anionic peroxidase promoter (Klotz et al., Plant 
MoL Biol., 36:509-520 (1998)). Preferred promoters enhancing expression in phloem, 
cortex and cork, but not xylem or pith, include the Psam-1 promoter (Mijnsbrugge et 
al., Plant and Cell Physiol., 37:1108-1115 (1996)). 

Preferred promoters enhancing expression in seeds include the phas promoter 
10 (Geest et al, Plant MoL Biol. 32:579-588 (1996)); the GluB-1 promoter (Takaiwa et 
al., Plant Mol. Biol. 30:1207-1221 (1996)); the gamma-zein promoter (Torrent et al. 
Plant MoL Biol. 34:139-149 (1997)), and the oleosin promoter (Sarmiento et al., The 
Plant Journal 11:783-796 (1997)). 

Other promoter sequences which mediate constitutive, inducible, tissue- 
15 specific or developmental stage-specific expression are disclosed in WO 2004/081 173 
to Evogene Ltd. 

Truncated or synthetic promoters including specific nucleotide regions 
conferring tissue-enhanced expression may also be used, as exemplified by 
identification of regulatory elements within larger promoters conferring xylem- 

20 enhanced expression (Seguin et aL, Plant Mol. BioL, 35:281-291 (1997); Torres- 
Schumann et aL, The Plant Journal, 9:283-296 (1996); and Leyva et al., The Plant 
Cell, 4:263-271 (1992)). 

The nucleic acid construct can be, for example, a plasmid, a bacmid, a 
phagemid, a cosmid, a phage, a virus or an artificial chromosome. Preferably, the 

25 nucleic acid construct of the present invention is a plasmid vector, more preferably a 
binary vector. 

The phrase "binary vector" refers to an expression vector which carries a 
modified T-region from Ti plasmid, enable to be multiplied both in E. coli and in 
Agrobacterium cells, and usually comprising reporter gene(s) for plant transformation 
30 between the two boarder regions. A binary vector suitable for the present invention 
includes pBI2113, pBI121, pGA482, pGAH, pBIG, pBHOl (Clonetech), pPI (see 
Example 5 of the Examples section which follows) or modifications thereof. 
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The nucleic acid construct of the present invention can be utilized to transform 
a host cell (e.g., bacterial, plant) or plant. 

As used herein, the terms "transgenic" or "transformed" are used 
interchangeably referring to a cell or a plant into which cloned genetic material has 
5 been transferred. 

In stable transformation, the nucleic acid molecule of the present invention is 
integrated into the plant genome, and as such it represents a stable and inherited trait. 
In transient transformation, the nucleic acid molecule is expressed by the cell 
transformed but not integrated into the genome, and as such represents a transient 
10 trait 

There are various methods of introducing foreign genes into both 
monocotyledonous and dicotyledonous plants (Potrykus, I. (1991). Annu Rev Plant 
Physiol Plant Mol Biol 42, 205-225; Shimamoto, K. et al. (1989). Fertile transgenic 
rice plants regenerated from transformed protoplasts. Nature (1989) 338, 274-276). 
15 The principal methods of the stable integration of exogenous DNA into plant 

genomic DNA includes two main approaches: 

(i) Agrobacterium-mediated gene transfer. See: Klee, H. J. et al. (1987). Annu 
Rev Plant Physiol 38, 467-486; Klee, H. J. and Rogers, S. G. (1989). Cell Culture and 
Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear Genes, 

20 pp. 2-25, J. Schell and L. K. Vasil, eds., Academic Publishers, San Diego, CaL; and 
Gatenby, A. A. (1989). Regulation and Expression of Plant Genes in Microorganisms, 
pp. 93-112, Plant Biotechnology, S. Kung and C. J. Arntzen, eds., Butterworth 
Publishers, Boston, Mass. 

(ii) Direct DNA uptake. See, e.g.: Paszkowski, J. et al. (1989). Cell Culture 
25 and Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear 

Genes, pp. 52-68, J. Schell and L. K. Vasil, eds., Academic Publishers, San Diego, 
CaL; and Toriyama, K. et al. (1988). Bio/Technol 6, 1072-1074 (methods for direct 
uptake of DNA into protoplasts). See also: Zhang et al. (1988). Plant Cell Rep 7, 379- 
384; and Fromm, M. E. et al. (1986). Stable transformation of maize after gene 
30 transfer by electroporation. Nature 319, 791-793 (DNA uptake induced by brief 
electric shock of plant cells). See also: Klein et al. (1988). Bio/Technology 6, 559- 
563; McCabe, D. E. et al. (1988). Stable transformation of soybean (Glycine max) by 
particle acceleration. Bio/Technology 6, 923-926; and Sanford, J. C. (1990). Biolistic 
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plant transformation. Physiol Plant 79, 206-209 (DNA injection into plant cells or 
tissues by particle bombardment). See also: Neuhaus, J. M. et al. (1987). Theor Appl 
Genet 75, 30-36; and Neuhaus, J. M. and Spangenberg, G. C. (1990). Physiol Plant 
79, 213-217 (use of micropipette systems). See U.S. Pat. No. 5,464,765 (glass fibers 
5 or silicon carbide whisker transformation of cell cultures, embryos or callus tissue). 
See also: DeWet, J. M. J. et al. (1985). "Exogenous gene transfer in maize (Zea mays) 
using DNA-treated pollen," Experimental Manipulation of Ovule Tissue, G. P. 
Chapman et al., eds., Longman, New York-London, pp. 197-209; and Ohta, Y. 
(1986). High-EfiBciency Genetic Transformation of Maize by a Mixture of Pollen and 

10 Exogenous DNA. Proc Natl Acad Sci USA 83, 715-719 (direct incubation of DNA 
with germinating pollen). 

The Agrobacterium-mediated system includes the use of plasmid vectors that 
contain defined DNA segments which integrate into the plant genomic DNA. 
Methods of inoculation of the plant tissue vary depending upon the plant species and 

15 the Agrobacterium delivery system. A widely used approach is the leaf-disc 
procedure, which can be performed with any tissue explant that provides a good 
source for initiation of whole-plant differentiation (Horsch, R. B. et al. (1988). "Leaf 
disc transformation." Plant Molecular Biology Manual A5 9 1-9, Kluwer Academic 
Publishers, Dordrecht). A supplementary approach employs the Agrobacterium 

20 delivery system in combination with vacuum infiltration. The Agrobacterium system 
is especially usefid for in the creation of transgenic dicotyledenous plants. 

There are various methods of direct DNA transfer into plant cells. In 
electroporation, the protoplasts are briefly exposed to a strong electric field, opening 
up mini-pores to allow DNA to enter. In microinjection, the DNA is mechanically 

25 injected directly into the cells using micropipettes. In microparticle bombardment, the 
DNA is adsorbed on microprojectiles such as magnesium sulfate crystals or tungsten 
particles, and the microprojectiles are physically accelerated into cells or plant tissues. 

Following stable transformation, plant propagation occurs. The most common 
method of plant propagation is by seed. The disadvantage of regeneration by seed 

30 propagation, however, is the lack of uniformity in the crop due to heterozygosity, 
since seeds are produced by plants according to the genetic variances governed by 
Mendelian rules. In other words, each seed is genetically different and each will grow 
with its own specific traits. Therefore, it is preferred that the regeneration be effected 



WO 2005/121364 



PCT/IL2005/000627 



24 

such that the regenerated plant has identical traits and characteristics to those of the 
parent transgenic plant. The preferred method of regenerating a transformed plant is 
by micropropagation, which provides a rapid, consistent reproduction of the 
transformed plants. 

5 Micropropagation is a process of growing second-generation plants from a 

single tissue sample excised from a selected parent plant or cultivar. This process 
permits the mass reproduction of plants having the preferred tissue and expressing a 
fusion protein. The newly generated plants are genetically identical to, and have all of 
the characteristics of, the original plant. Micropropagation allows for mass production 

10 of quality plant material in a short period of time and offers a rapid multiplication of 
selected cultivars with preservation of the characteristics of the original transgenic or 
transformed plant. The advantages of this method of plant cloning include the speed 
of plant multiplication and the quality and uniformity of the plants produced. 

Micropropagation is a multi-stage procedure that requires alteration of culture 

15 medium or growth conditions between stages. The micropropagation process involves 
four basic stages: stage one, initial tissue culturing; stage two, tissue culture 
multiplication; stage three, differentiation and plant formation; and stage four, 
greenhouse culturing and hardening. During stage one, the tissue culture is established 
and certified contaminant-free. During stage two, the initial tissue culture is 

20 multiplied until a sufficient number of tissue samples are produced to meet production 
goals. During stage three, the newly grown tissue samples are divided and grown into 
individual plantlets. At stage four, the transformed plantlets are transferred to a 
greenhouse for hardening where the plants' tolerance to light is gradually increased so 
that they can continue to grow in the natural environment. 

25 Although stable transformation is presently preferred, transient transformation 

. of, for instance, leaf cells, meristematic cells, or the whole plant is also envisaged by 
the present invention. 

Transient transformation can be effected by any of the direct DNA transfer 
methods described above or by viral infection using modified plant viruses. 

30 Viruses that have been shown to be useful for the transformation of plant hosts 

include cauliflower mosaic virus (CaMV), tobacco mosaic virus (TMV), and 
baculo virus (BV). Transformation of plants using plant viruses is described in, for 
example: U.S. Pat. No. 4,855,237 (bean golden mosaic virus, BGMV); EPA 67,553 
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(TMV); Japanese Published Application No. 63-14693 (TMV); EPA 194,809 (BV); 
EPA 278,667 (BV); and Gluzman, Y. et al. (1988). Communications in Molecular 
Biology: Viral Vectors, Cold Spring Harbor Laboratory, New York, pp. 172-189. The 
use of pseudovirus particles in expressing foreign DNA in many hosts, including 
5 plants, is described in WO 87/0626 1 . 

Construction of plant RNA viruses for the introduction and expression of non- 
viral exogenous nucleic acid sequences in plants is demonstrated by the above 
references as well as by: Dawson, W. O. et al, (1989). A tobacco mosaic virus-hybrid 
expresses and loses an added gene. Virology 772, 285-292; French, R. et al. (1986) 
10 Science 231, 1294-1297; and Takamatsu, N. et al. (1990). Production of enkephalin in 
tobacco protoplasts using tobacco mosaic virus RNA vector. FEBS Lett 269, 13-16. 

If the transforming virus is a DNA virus, one skilled in the art may make 
suitable modifications to the virus itself. Alternatively, the virus can first be cloned 
into a bacterial plasmid for ease of constructing the desired viral vector with the 
15 foreign DNA. The virus can then be excised from the plasmid. If the virus is a DNA 
virus, a bacterial origin of replication can be attached to the viral DNA, which is then 
replicated by the bacteria. Transcription and translation of the DNA will produce the 
coat protein, which will encapsidate the viral DNA. If the virus is an RNA virus, the 
virus is generally cloned as a cDNA and inserted into a plasmid. The plasmid is then 
20 used to make all of the plant genetic constructs. The RNA virus is then transcribed 
from the viral sequence of the plasmid, followed by translation of the viral genes to 
produce the coat proteins which encapsidate the viral RNA. 

Construction of plant RNA viruses for the introduction and expression in 
plants of non-viral exogenous nucleic acid sequences, such as those included in the 
25 construct of the present invention, is demonstrated in the above references as well as 
in U.S. Pat No. 5,316,931. 

In one embodiment, there is provided for insertion a plant viral nucleic acid, 
comprising a deletion of the native coat protein coding sequence from the viral 
nucleic acid, a non-native (foreign) plant viral coat protein coding sequence, and a 
30 non-native promoter, preferably the subgenomic promoter of the non-native coat 
protein coding sequence, and capable of expression in the plant host, packaging of the 
recombinant plant viral nucleic acid, and ensuring a systemic infection of the host by 
the recombinant plant viral nucleic acid. Alternatively, the native coat protein coding 
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sequence may be made non-transcribable by insertion of the non-native nucleic acid 
sequence within it, such that a non-native protein is produced. The recombinant plant 
viral nucleic acid construct may contain one or more additional non-native 
subgenomic promoters. Each non-native subgenomic promoter is capable of 
5 transcribing or expressing adjacent genes or nucleic acid sequences in the plant host 
and incapable of recombination with each other and with native subgenomic 
promoters. In addition, the recombinant plant viral nucleic acid construct may contain 
one or more c/s-acting regulatory elements, such as enhancers, which bind a trans- 
acting regulator and regulate the transcription of a coding sequence located 

10 downstream thereto. Non-native nucleic acid sequences may be inserted adjacent to 
the native plant viral subgenomic promoter or the native and non-native plant viral 
subgenomic promoters if more than one nucleic acid sequence is included. The non- 
native nucleic acid sequences are transcribed or expressed in the host plant under 
control of the subgenomic promoters) to produce the desired products. 

15 In a second embodiment, a recombinant plant viral nucleic acid construct is 

provided as in the first embodiment except that the native coat protein coding 
sequence is placed adjacent to one of the non-native coat protein subgenomic 
promoters instead of adjacent to a non-native coat protein coding sequence. 

In a third embodiment, a recombinant plant viral nucleic acid construct is 

20 provided comprising a native coat protein gene placed adjacent to its subgenomic 
promoter and one or more non-native subgenomic promoters inserted into the viral 
nucleic acid construct. The inserted non-native subgenomic promoters are capable of 
transcribing or expressing adjacent genes in a plant host and are incapable of 
recombination with each other and with native subgenomic promoters. Non-native 

25 nucleic acid sequences may be inserted adjacent to the non-native subgenomic plant 
viral promoters such that said sequences are transcribed or expressed in the host plant 
under control of the subgenomic promoters to produce the desired product. 

In a fourth embodiment, a recombinant plant viral nucleic acid construct is 
provided as in the third embodiment except that the native coat protein coding 

30 sequence is replaced by a non-native coat protein coding sequence. 

Viral vectors are encapsidated by expressed coat proteins encoded by 
recombinant plant viral nucleic acid constructs as described hereinabove, to produce a 
recombinant plant virus. The recombinant plant viral nucleic acid construct or 
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recombinant plant virus is used to infect appropriate host plants. The recombinant 
plant viral nucleic acid construct is capable of replication in a host, systemic spread 
within the host, and transcription or expression of one or more foreign genes (isolated 
nucleic acid) in the host to produce the desired protein. 
5 In addition to the above, the nucleic acid molecule of the present invention can 

also be introduced into a chloroplast genome thereby enabling chloroplast expression. 

A technique for introducing exogenous nucleic acid sequences to the genome 
of the chloroplasts is known. This technique involves the following procedures. First, 
plant cells are chemically treated so as to reduce the number of chloroplasts per cell to 

10 about one. Then, the exogenous nucleic acid is introduced into the cells preferably via 
particle bombardment, with the aim of introducing at least one exogenous nucleic acid 
molecule into the chloroplasts. The exogenous nucleic acid is selected by one 
ordinarily skilled in the art to be capable of integration into the chloroplast's genome 
via homologous recombination, which is readily effected by enzymes inherent to the 

15 chloroplast. To this end, the exogenous nucleic acid comprises, in addition to a gene 
of interest, at least one nucleic acid sequence derived from the chloroplasts genome. 
In addition, the exogenous nucleic acid comprises a selectable marker, which by 
sequential selection procedures serves to allow an artisan to ascertain that all or 
substantially all copies of the chloroplast genome following such selection include the 

20 exogenous nucleic acid. Further details relating to this technique are found in U.S. 
Pat. Nos. 4,945,050 and 5,693,507, which are incorporated herein by reference. A 
polypeptide can thus be produced by the protein expression system of the chloroplast 
and become integrated into the chloroplasts inner membrane, 

Downregulation of a gene of interest can be effected on the genomic and/or 

25 the transcript level using a variety of molecules that interfere with transcription and/or 
translation (e.g., antisense, siRNA), or on the protein level using, e.g., antibodies, 
immunization techniques and the like. 

For example, an agent capable of downregulating an activity of a polypeptide 
of interest is an antibody or antibody fragment capable of specifically binding a 

30 polypeptide of the present invention. Preferably, the antibody specifically binds at 
least one epitope of the polypeptide of interest As used herein, the term "epitope" 
refers to any antigenic determinant on an antigen to which the paratope of an antibody 
binds. 
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Down -regulation at the RNA level can be effected by RNA-based silencing 
strategies which are effective in plants. See for example, Kusaba (2004) RNA 
interference in crop plants. Curr. Opin. Biotechnol. 15(2): 139-43; Matzke (2001) 
RNA based silencing strategies in plants. Curr. Opin. Genet. 1 1 :221-7. 
5 For example, an agent capable of downregulating a polynucleotide of interest 

is a small interfering RNA (siRNA) molecule in the process of RNA interference 
(RNAi). 

dsRNAs can be delivered to plants in several ways (reviewed in Waterhouse 
P, Helliwell C. 2003. Exploring plant genomes by RNA-induced gene silencing. 

10 Nature Genet 4: 29-38): microprojectile bombardment with dsRNA or intron- 
containing hairpin RNA (ihpRNA)-expressing vectors; infiltration of plant tissue with 
an Agrobacterium strain carrying a T-DNA expressing an ihpRNA transgene; virus 
induced gene silencing (VTGS), in which the target sequence is integrated into viral 
sequences which are used to infect the plant, or are expressed from Agrobacterium- 

15 introduced transgenes, and by stable transformation with ihpRNA expressing 
transgenes. The various RNAi techniques each have advantages and disadvantages 
with respect to how persistent their effect is and the range of plants to which they can 
be applied, e.g. bombardment can be applied to any plant, but produces only transient 
effects. Alternatively, transformation with ihpRNA-expressing transgenes provides 

20 stable and heritable gene silencing, but requires efficient plant transformation 
techniques. ihpRNA transgenes have been shown to be very effective for a wide range 
of target genes in various plant species (reviewed in Waterhouse P, Helliwell C. 2003. 
Exploring plant genomes by RNA-induced gene silencing. Nature Genet 4: 29-38; 
Wesley S, Helliwell C, Smith N, et al. 2001. Construct design for efficient, effective 

25 and high-throughput gene silencing in plants. Plant J27: 581-590), indicating that the 
RNAi mechanism is probably conserved in all plant species. This is supported by a 
recent report of RNAi in the non- vascular moss Physcomitrella patens (Bezanilla M, 
Pan A, Quatrano R. 2003. RNA interference in the moss Physcomitrella patens. Plant 
Physiol 133: 470-474). 

30 Antisense genetic constructs for fiber specific promoters (e.g., for SEQ ID 

NO: 85, 91) can be used to inhibit or lessen the expression of one or more fiber genes 
in fiber cells. The use of antisense constructs is described in U.S. Pat. No. 5,495^070 
and in Smith, et al. Nature 334 724-726, 1988; Bird, et al. Bio/Technology 9: 635- 
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639, 1991; Van der Krol, et al. Gene 72: 45-50, 1988. 

It will be appreciated that the generation of fiber producing plant of desired 
traits according to the present invention can also be effected by crossing each of the 
above genetically modified plants with wild type, hybrid or transgenic plants, using 

5 methods which are well known in the art. 

Once the transgenic planta of the present invention are generated, fibers are 
harvested (for example by mechanical picking and/or hand-stripping) and fiber yield 
and quality is determined. 

The following describes methods of qualifying cotton fibers. 

10 Fiber length - Instruments such as a fibrograph and HVI (high volume 

instrumentation) systems are used to measure the length of the fiber. HVI instruments 
compute length in terms of "mean" and "upper half mean" (UHM) length. The mean 
is the average length of all the fibers while UHM is the average length of the longer 
half of the fiber distribution. 

15 Fiber strength - As mentioned, fiber strength is usually defined as the force 

required to break a bundle of fibers or a single fiber. In HVI testing the breaking force 
is converted to "grams force per tex unit." This is the force required to break a bundle 
of fibers that is one tex unit in size. In HVI testing the strength is given in grams per 
tex units (grams/tex). Fibers can be classified as low strength (e.g., 19-22 gms/tex), 

20 average strength (e.g., 23-25 gms/tex), high strength (e.g., 26-28 gms/tex), and very 
high strength (e.g., 29-36 gms/tex). 

Micronaire - The micronaire reading of a fiber is obtained from a porous air 
flow test. The test is conducted as follows. A weighed sample of cotton is compressed 
to a given volume and controlled air flow is passed through the sample. The resistance 

25 to the air flow is read as micronaire units. The micronaire readings reflects a 
combination of maturity and fineness. Since the fiber diameter of fibers within a given 
variety of cotton is fairly consistent, the micronaire index will more likely indicate 
maturity variation rather than variations in fineness. A micronaire reading of 2.6-2.9 
is low while 3.0-3.4 is below average, 3.5-4.9 is average and 5.0 and up are high. For 

30 most textile applications a micronaire of 3.5-4.9 is used. Anything higher than this is 
usually not desirable. It will be appreciated though, that different applications require 
different fiber properties. Thus, it is understood that a fiber property that is 
disadvantageous in one application might be advantageous in another. 
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As is illustrated in the Examples section, which follows, biomolecular 
sequences of the present invention are capable of increasing trichome/leaf hair 
number and length, as well as seed hair. As such biomolecular sequences of the 
present invention can be used to generate transgenic plants with increased trichome 

5 number/length which better deter herbivores, guide the path of pollinators, or affect 
photosynthesis, leaf temperature, or water loss through increased light reflectance. 
Additionally such transgenic plants may be used for the compartmentalized 
production of recombinant proteins and chemicals in trichomes, as described in 
details in WO 2004/1 1 1 183 to Evogene Ltd 

10 Interestingly and unexpectedly, the present inventors found that 

polynucleotide sequences of the present invention are capable of increasing a biomass 
of a plant. It will be appreciated that the ability of the polypeptides of the present 
invention to increase plant yield/biomass/vigor is inherent to their ability to promote 
the increase in plant cell-size or volume (as described herein). 

15 Thus, the present invention also envisages a method of increasing a 

biomass/vigor/yield of a plant (coniferous plants, moss, algae, monocot or dicot, as 
well as other plants listed in www.nationmaster.com/encyclopedia/Plantae). This is 
effected by regulating expression and/or activity of at least one of the polynucleotides 
of the present invention, as described above. 

As used herein the phrase "plant biomass" refers to the amount or quantity of 
tissue produced from the plant in a growing season, which could also determine or 
affect the plant yield or the yield per growing area. 

As used herein the phrase "plant vigor" refers to the amount or quantity of 
tissue produced from the plant in a given time. Hence increase vigor could determine 
o or affect the plant yield or the yield per growing time or growing area. 

As used herein the phrase "plant yield" refers to the amount or quantity of 
tissue produced and harvested as the plant produced product Hence increase yield 
could affect the economic benefit one can obtain from the plant in a certain growing 
are and/or growing time. 

20 Thus, the present invention is of high agricultural value for promoting the 

yield of commercially desired crops (e.g., biomass of vegetative organ such as poplar 
wood, or reproductive organ such as number of seeds or seed biomass). 
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As used herein the term "about" refers to ± 10 %. 

Additional objects, advantages, and novel features of the present invention will 
become apparent to one ordinarily skilled in the art upon examination of the following 
5 examples, which are not intended to be limiting. Additionally, each of the various 
embodiments and aspects of the present invention as delineated hereinabove and as 
claimed in the claims section below finds experimental support in the following 
examples. 

10 

EXAMPLES 

Reference is now made to the following examples, which together with the 
above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures utilized 

15 in the present invention include molecular, biochemical, microbiological and 
recombinant DNA techniques. Such techniques are thoroughly explained in the 
literature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et 
al., (1989); "Current Protocols in Molecular Biology" Volumes I-III Ausubel, R. M., 
ecL (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and 

20 Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", 
John Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", Scientific 
American Books, New York; Birren et al. (eds) "Genome Analysis: A Laboratory 
Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); 
methodologies as set forth in U.S. Pat Nos. 4,666,828; 4,683,202; 4,801,531; 

25 5,192,659 and 5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-HI 
Cellis, J. E., ed. (1994); "Current Protocols in Immunology" Volumes I-HI Coligan J. 
E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" (8th Edition), 
Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), "Selected Methods 
in Cellular Immunology", W. H. Freeman and Co., New York (1980); available 

30 immunoassays are extensively described in the patent and scientific literature, see, for 
example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 
3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 
4,098,876; 4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. 
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J., ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. J., eds. 
(1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., Eds. (1984); 
"Animal Cell Culture" Freshney, R. L, ed. (1986); "Immobilized Cells and Enzymes" 
IRL Press, (1986); "A Practical Guide to Molecular Cloning" Perbal, B., (1984) and 
"Methods in Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To 
Methods And Applications", Academic Press, San Diego, CA (1990); Marshak et al., 
"Strategies for Protein Purification and Characterization - A Laboratory Course 
Manual" CSHL Press (1996); all of which are incorporated by reference as if fully set 
forth herein. Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided for the 
convenience of the reader. All the information contained therein is incorporated 
herein by reference. 

EXAMPLE 1 

In silico identification of cotton genes involved in fiber formation 
Experimental Procedures 

Interspecies comparison of expressed sequences- Two main tools were used 
during the data mining stage. Large numbers of gene profiles were queried from an 
ORACLE database housing Compugen' s GeneCarta platform (Compugen Ltd. Israel). 
This data was loaded into Microsoft Excel spreadsheets for further manual 
refinement. Using this data a cross species genomic comparison was effected, aiming 
at defining organs from other plant species for which publically available EST 
libraries can be used both as models and as new sources of information to define new 
genes with key role in fiber formation (Figure 1). This comparison analysis used 
mainly the cotton, arabidopsis and tomato databases. 

Clustering and interspecies clustering of EST sequences - The cotton 
genomic database included less than 50,000 ESTs (Genbank release #135) originating 
primarily from two species Gossypium arboreum (~ 35,000 ESTs) and Gossypium 
hirsutum L. (~ 9,000 ESTs, Table 1, below). These ESTs were clustered and 
assembled using the LEADS™ software platform (Compugen Ltd, Israel) in two 
alternative approaches. 

In the first approach, the ESTs from two species were clustered and assembled 
together (thereby mimicking the evolutionary process since G. arboreum is an 
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ancestor of G. hirsutum). This process revealed 6478 clusters among them 3243 new 
clusters (without mRNA in the public database) that were defined as high quality 
clusters (Table 1, below). 

In the second approach, ESTs from each species were clustered and assembled 

5 separately. Comparison between the two approaches showed that using the first 
approach adds valuable information to the cotton clusters without a significant bias in 
the analysis. The tomato genomic database contains 126,156 ESTs originating from 
about 30 well defined libraries that through the clustering and assembling process 
revealed 14034 clusters of which a large group of 12787 new high quality clusters 

10 (Table 1). The genomic data of arabidopsis includes 99417 ESTs 
(ftp://ftp.ncbi.nih.gov/genbank/), 8573 full length cDNA (Rikken and genbank 
mRNAs ftp://ftp.ncbi.nih.gov/genbank/) and the entire DNA sequence. Using the 
LEADS software 23,148 clusters and 6777 singeltones (Single ESTs which no other 
EST was clustered therewith) were revealed, all of which were supported by ESTs 

15 sequences, contrary to the public consortium (TAIR, www.arabidopsis.org/). 

EST libraries from other plants and organs that share similar biological 
processes as cotton fiber were sought. Such ESTs are expected to serve as models 
and as new information sources for the identification of genes which are involved in 
the fiber development. To this end, a list of known genes that are suspected to be 

20 involved in fiber formation was generated. These genes originated from arabidopsis 
and were shown in various studies to have a key role in trichome formation (i.e., GL2, 
CPC, bHLH, TTG1, GL1, reviewed in Larkin J.C. et.al. 2003, Schellmann S. et aL 
2002). Extensive comparative genomic analysis revealed that tomato genes, with high 
homology to cotton fiber genes and to arabidopsis trichome genes have a significant 

25 EST content in either leaf trichome and specific flower development libraries. Further 
analysis compared the genomic data of these three species — cotton, Arabidopsis and 
tomato (focusing on the tomato libraries mentioned above) as key parameters in the 
present database search (Figure 1), 

30 Table 1 



Genomic databases of Cotton, Tomato and Arabidopsis 



Species 


EST Lib 
description 


EST count 


mRNA 


After LEADS 
(clusters^) 


G. arborewn 


Fiber 6DPA 


37,276 


12 


16,294 clusters 
on mixed 


G. hirsutum 


Fiber 7-10 DPA 


7,944 


236 
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G. hirsutum 


Flower ovule 

1 TXJD A 


1,272 


870 


production* 


L. esculentum 


All libraries 


115,859 


7 


25,678 clusters 
on mixed 
production 


L. hirsutum 


Trichome 
libraries 


2,409 


7 


L. pennellii 


Trichome 
libraries 


2,723 


24,450 


A. thaliana 


All libraries 


160,698 


mRNA 


25,678 clusters 



♦clusters derived from different species, cotton G. arboreum and G. hirsutum, tomato L. esculentum, L. 
hirsutum and L. pennellii 



In silico identification of cotton genes with a role in fiber development To 
5 find whether tomato genomic data can be used as a relevant source of genomic data to 
study cotton fiber development an extensive genomic comparison was effected to 
identify both tomato and cotton genes that have high homology to key genes 
detemiining arabidopsis trichome development (e.g., GL2, CPC, bHLH, TTG1, GL1). 

Homologous genes were identified in cotton and tomato. Because almost all 
10 cotton ESTs were produced from cotton fibers, it was impossible to do in-silico 
prediction of the expression profile of those genes. However, wide tissue sources used 
for the production of the tomato EST database enabled identification of tissues in 
which trichome specific genes are expressed. 

In tomato it was revealed that both trichome and ovule ESTs are enriched in 
15 clusters representing trichome specific genes. Interestingly, it was found that cotton 
fibers are produced from ovule coat ceils. As tomato seeds are covered with hairy like 
tissue, similarly to cotton seeds, it was postulated that those hairs are developmentally 
linked to trichome and cotton fiber formation. 

In tomato —1100 clusters were found to include at least one EST from 
20 trichome libraries. Among them about 1000 sequences included sequences also 
originating from tomato flower libraries (in which the ovule tissue is present). 
Comparing this group of genes to cotton data revealed —2300 cotton genes with high 
homology to the tomato trichome genes. Mining the database using these two groups 
of genes together with other bioinformatic information [cross species homology, Gene 
25 Onthology (GO)] revealed 80 cotton clusters predicted to have a key role in fiber 
formation. Those genes were selected based on the following criteria: 

Cotton clusters with at least 2 ESTs; 

Homology to tomato cluster with e-score higher than le-5; 
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Homology to tomato cluster with at least one EST coming from trichome 
libraries or one EST coming from ovule containing tissues; 

The following criteria were considered as advantageous although not 
necessary: 

5 Large number of ESTs in a cluster; 

Transcription factor/ signal transduction proteins; 

Gene annotation related to cell expansion, turgor pressure, cell-wall synthesis. 
The new genes together with the control cotton genes known to be involved in 
fiber formation were further analysed for their RNA expression profile in cotton 
10 plants. 



EXAMPLE 2 

tnRNA expression analysis of genes identified according to the teachings of the 

present invention 

To study the RNA expression profile of candidate genes identified as described in 
Example 1 above, a reverse transcription was effected followed by real time PCR 
(RT-qPCR). 

15 Experimental Procedures 

Quantitative Real time PCR analysis (qRT PCR) - To verify the levels of 
expression specificity and trait-association, Reverse Transcription following 
quantitative (Real-Time) PCR (RTqPCR) was effected. Total RNA was extracted at 
different stages of fiber development (from the day of anthesis till day 20 - post 

20 anthesis). To study the specificity of expression, RNA from other tissues of the cotton 
plants were collected and analysed for control expression (i.e., young leaves, young 
stems, mature stems, young roots, sepals, petals, and stamen). For this purpose, RNA 
was extracted from Cotton tissue using Hot Borate RNA Extraction protocol 
according to www.eeob.iastate.edu/faculty/Wen^ Reverse 

25 transcription was effected using 1.5 jag total RNA, using 300 U Super Script II 
Reverse Transcriptase enzyme (Invitrogen), 225ng random deoxynucleotide hexamers 
(Invitrogen), 500 pM dNTPs mix (Takara, Japan), 0.2 volume of x 5 RT buffer 
(Invitrogen), 0.01M DTT, 60U RNAsin (Promega), DEPC treated double distilled 
water was added up to 37.5 pi. RT reactions were incubated for 50 min at 42 °C, 
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followed by 70 °C for 15 min. cDNA was diluted 1:20 in Tris EDTA, pH=8. 5mL of 
the diluted cDNA was used for qRT-PCR. 

Quantitative RT-PCR was performed on cDNA (5 |xL), using xl SYBR 
GREEN PCR master mix (Applied Biosystems), forward and reverse primers 0.3 (iM 
5 each. The ABI7000real-time PCR machine was used with the following conditions 50 
°C for 2 min, 95 °C for 10 min, 40 times of 95 °C for 15 sec and 1 min at 60 °C, 
followed by 95 °C for 15 sec, 60 °C for 60 sec, and 70 times of 60 °C for 10 sec +0.5 
°C increase in each cycle. For each gene, a standard curve was prepared from a pool 
of RTs from all samples, in 5 dilutions (dilutions - 1:60, 1:200, 1:600, 1:2000, 

10 1 : 10000). The standard curve plot [ct (cycle threshold) vs. log (concentration)] should 
have R>=0.98 with an efficiency in the range of 100% +/- 5%. The levels of 
expression (Qty) measured in the qPCR were calculated using the efficiency (E) of 
the amplification reaction and the corresponding CT. (the cycle at which the samples 
crossed the threshold) Qty=E-C.T.. The dissociation curves obtained were inspected 

15 for the absence of unwanted additional PCR products or primer-dimers. Reactions 
were repeated at least twice. The calculation method is based in the fact that the 
efficiencies of the reactions of the GOI (gene of interest) and of the housekeeping 
genes are similar. 

To normalize the expression level between the different tissues, specific 
20 primers were designed for specifically hybridizing with the following housekeeping 
genes: Actin (GenBank Accession No. D88414 SEQ ID NO: 28, Forward and reverse 
primers are set forth in SEQ ID NO: 68 and 69, respectively), GAPDH (GenBank 
Accession No. COTCWPPR, partial sequence, SEQ ID NO: 29, Forward and reverse 
primers are set forth in SEQ ID NO: 97 and 98, respectively), and RPL19 (GenBank 
25 Accession No. AI729179, SEQ ID NO: 30, Forward and reverse primers are set forth 
in SEQ ID NO: 99 and 100, respectively). 

Using this methodology it was possible to identify genes that show elevated 
expression during fiber elongation, as well as genes that show unique cotton fiber 
specificity. Genes that showed elevated expression during anthesis that decreases 
30 during fiber elongation were considered good candidates to be involved in fiber 
differentiation and initiation. Notably, the above-described quantification 
methodology did not provide absolute expression levels, but provided good 
parameters for scoring the relative gene expression along fiber development as 
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differences as high as over 1000 fold in the maximal levels of expression reached by 
different genes were detected (Table 2, below). 
Results 

88 cotton genes were evaluated for expression profile in different tissues of 
cotton (Gossypium hirsutum, var Acala). According to the gene expression results, 23 
genes were predicted to improve fiber yield and quality. Expression profile of all the 
candidate genes are presented in Table 2. 
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young 
stems 


0.202 


0.025 


0.063 


0.003 


0.044 


0.049 


0.012 j 


000*0 


0.067 


0.762 


0.000 


0.007 


0.004 


0.047 


1.294 


0.480 


0.464 


2.759 


0.000 


0.003 


1.089 


0.000 


0.035 


0.007 


young 
roots 


0.002 


0.068 


0.037 


0.003 


0.037 


0.000 


0.000 


0.000 


0.069 


1.308 




0.001 


0.001 


0.005 


1.177 


0.004 


0.003 


0.872 


0.000 


0.001 


0.004 




0.071 


0.004 


young 
leaves 


0.347 


0.021 


0.142 


0.001 


0.148 


0.055 


0.008 


0.000 


0.037 


0.463 




0.001 


0.005 


0.060 
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Two main criteria were used to select cotton genes as candidates that may be 
involved in fiber development according to their RNA profiling. Genes showing a 
high degree of fiber expression specificity and genes displaying expression level, 
which changes concomitantly with fiber development (Table 3, below). 

5 Twenty three genes met these selection criteria: 

CT-1 (SEQ ID NOs. 1 and 106), CT_2 (SEQ ID NOs.2 and 107), CT_3 (SEQ 
ID NOs. 3 and 108), CT_4 (SEQ ID NOs. 4 and 109) CT_6 (SEQ ID NOs. 5 and 
110), CT_7 (SEQ ID NOs. 6 and 1 1 1), CT_9 (SEQ ID NOs. 7 and 1 12), CT_1 1 (SEQ 
ID NOs. 8 and 113), CT_20 (SEQ ID NOs. 9 and 114), CT_22 (10 and 115), CT_26 

10 (SEQ ID NOs. 11 and 116), CT_27 (SEQ ID NOs. 12 and 117), CT_40 (SEQ ID 
NOs. 16 and 118), CT_49 (SEQ ID NOs. 17 and 119), CT_70 (SEQ ID NOs. 18 and 
120), CT71 (SEQ ID NOs. 19 and 121), CT_74 (SEQ ID NOs.20 and 122), CT_75 
(SEQ ID NOs. 21 and 123), CT76 (SEQ ID NOs. 22 and 124), CT77 (SEQ ID 
NOs. 23 and 125), CT_81 (SEQ ID NOs. 24 and 126), CT_82 (SEQ ID NOs. 25 and 

15 95), CT_84 (SEQ ID NOs. 27 and 96) and CT_88 (SEQ ID NOs. 13 and 26). 

CT-4, 22, 20, 27, 40, 82 (SEQ ID NOs: 4, 10, 9, 12, 16 and 25, respectively) 
were chosen mainly as candidate genes that may have a role in fiber initiation (Table 
3) while CT 27 (SEQ ID NO: 12), which is a homologue gene to GL3, was also used 
as a control (in Figure 2d CT 22, SEQ ID NO: 10 is shown). 

20 CT-1, 2, 3, 6, 7, 9, 49, 70, 71, 74, 75, 76, 77, 81, 84 (SEQ ID NOs. 1, 2, 3, 5, 6, 

7, 17, 18, 19, 20, 21, 22, 23, 24 and 27, respectively, see Figures 2a, c) were predicted 
to be involved in the fiber elongation and quality (strength and finesse) according to 
their expression pattern (Table 3, Figure 2C CT 1 is shown). 

CTll, 40, 74 and CT 26 (SEQ ID NOs. 8, 16, 20 and 11, respectively, see 

25 Figures 2a, b) which are homologous to Glabrousl from Arabidopsis (GenBank 
Accession No. AB006078) are fiber specific genes that showed uniform and fiber- 
specific expression during all stages of fiber development (Table 3, in Figure 2B CT 
1 1 is showri as an example). Expression profile of all the chosen genes are shown in 
Table 2, above. 
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The selected genes were over-expressed in transgenic arabidopsis and tomato, using 
the constitutive CaMV promoter of 35S (SEQ ID NO. 31). Transgenic plants were 
further evaluated for epidermal modifications, trichome density and length and seed 
hair yield (as further described hereinbelow). 

5 

EXAMPLE 3 

Analysis of gene expression using publically available microarrays 
Further information about the expression of the selected genes (Example 2, 
above) was retrieved by statistical analysis of microarray data from arabidopsis. 
10 Essentially, the best homologs of the new candidate genes in arabidopsis were 
compared to a set of 77 microarrays experiment of different tissues of Arabidopsis 
(AtGenExpress databases, the Principal investigator for AFGN: Prof. Dr. Lutz Nover, 
Botanisches Institut, Molekulare Zellbiologie, FB Biologie und Informatik der J. W. 
Goethe Universitat Frankfurt; Biozentrum N200 30G, Marie-Curie-Strasse 9, 60439 
15 Frankfurt am Main, www.arabidopsis.or^info/expression/ATGenExpress.isp) . 

Polynucleotide sequences that were highly expressed in elongated cells or 
inflorescence meristems were selected for further analysis. 

Table 4 below lists tissues which exhibit the highest levels of gene expression. 



Table 4 


f 




Tissues with high expression 


< Fold change/ 
specificity 


Related to fiber 


CT_1 


Seed, siliques 


10-20 i 


Elongated cells 


CT_11 


carpels, flower, seed, siliques 


Tissue specific 


Flower specific 


CTJ2 


root, seedlin and sepals 


Tissue specific 


Elongated cells, 


CT_22 


carpels, flower, inflorescence, shoot 


4-10 


inflorescence 


CT_4 


Petals, stamen 


>10 


Elongated cells, 


CT49 


siliques 


>2 


Elongated cells, 


CT_7 


carpels, flower, inflorescence, petals, shoot, 
siliques, 


10-30 


inflorescence 


CT_70 


flower, root, stamen 


Almost tissue 
specific 




CT76 


carpels, flower, inflorescence, shoot, 
siliques 


>2 


Elongated cells, & 
inflorescence 


CT77 


seeds, pollen, stemen, petals, sepals, siliques 


10-50 


Elongated cells 


CT_82 


inflorescence, shoot stem 


3-6 


inflorescence 


CT_88 


petals, stamen 




Elongated cells 
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EXAMPLE 4 

Establishing a correlation between expression of candidate genes and fiber length 
In order to define correlations between the levels of RNA expression of the 
5 selected genes and fiber length, fibers from 4 different cotton lines were analyzed. 
These fibers were selected showing very good fiber quality and high lint index (Pima 
types, originating from other cotton species, namely G. barbadense) and different 
levels of quality and lint indexes from various G. hirsutum lines: good quality and 
high lint index (Acala type), medium lint index (Coker type) and poor quality and 
10 short lint index (Tamcot type). 

Experimental procedures 

RNA extraction - Fiber development stages, representing different fiber 
characteristic, at 5, 10 15 and 20 DPA were sampled and RNA was extracted as 
describe in Example 2. 

15 Fiber assessment - Fiber length of the above lines was measured using 

fibrograph. The fibrograph system was used to compute length in terms of "Upper 
Half Mean" length. The upper half mean (UHM) is the average length of longer half 
of the fiber distribution. The fibrograph measures length in span lengths at a given 
percentage point (www.cottoninc.conVClassificationofCotton/?Pg^=4#Length . > ) 

20 Results 

Four different cotton lines were grown in Rehovot, Israel during summer 2004, 
and their fiber length was measured. The fibers UHM values are summarized in Table 
5, below: 

25 TableS 





Length (UHM) 


PimaS5 


1.40±0a 


Acala 


1.23 ± 0.01b 


Coker 3 10 


1.18 ± 0.01c 


Tamcot 


1. 15 ± 0.02 c. 



Five genes were tested for correlation between gene expression and fiber 
length (presented for CTJ76 in Figure 3). The results are summarized in the Table 6 
below: 
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Table 6 





Tissue Sampling Day (DPA) 


n 
u 


5 


10 


15 


Relative 
amounts 
ofmRNA 


■3 rtlft 4**m -m ra 

Jveiaiive 
amounts 
of 

mRNA 


Relative 
expression 
Related to 
TO 


Keiauve 
amounts 
of 

mRNA 


Relative 
expression 
Related to 
TO 


iveiaiive 
amounts 
of 

mRNA 


Relative 
expression 
Related to 
TO 


CT_1 


Tamcot 


0.75 


2.99 


4.0 


4.71 








Coker310 


0.51 


4.80 


9.3 


7.56 








Acala 


0.64 


5.08 


7.9 


8.01 








CT_2 


Tamcot 


0.03 


0.19 


7.6 


8.17 








Coker 310 


0.03 


0.35 


11.4 


15.04 








Acala 


0.02 


0.36 


17.7 


15.28 








Pima S5 


0.02 


0.41 


23.6 


17.58 








CT.40 


Tamcot 


0.28 










0.47 


1.67 


Coker 310 


0.37 










0.46 


1.24 


Acala 


0.30 










0.67 


2.25 


Pima S5 


0.37 










1.03 


2.75 


CT.76 


Tamcot 


0.01 


0.03 


5.4 


0.01 


2.3 


0.00 


0.10 


Coker 310 


0.01 


0.08 


8.9 


0.04 


5.1 


0.00 


0.10 


Acala 


0.01 


0.12 


16.6 


0.06 


9.1 


0.00 


0.12 


Pima S5 


0.01 


0.13 


122.4 


0.18 


177.9 


0.12 


99.51 


CT.81 


Tamcot 


0.50 


1.33 


2.68 


5.03 


10.15 


1.11 


2.24 


Coker 310 


0.31 


2.64 


8.65 


4.51 


14.76 


0.84 


2.75 


Acala 


0.49 


4.38 


8.98 


6.36 


13.05 


3.65 


7.49 



Reverse-transcription following quantitative PCR was performed using real-time PCR, on tissues of 0, 
5 10 and 15 DPA of cotton (G. hirsutum var Tamcot, Coker and Acala, and G. barbadense var Pima 
S5) plants. Relative amounts of mRNA and Relative expression related to TO of each gene are 
presented in all examined tissues. 



EXAMPLES 

Cloning of the selected genes in a binary vector under constitutive regulation and 

recombinant expression of the same 
ORF analysis - Gene sequences of the present invention were analyzed for ORFs 
using Gene Runner software version 3.05 (Hasting Software, Inc: 
www.generutiner.com/) . ORFs of each gene were compared to Genbank database, 
using Blast (www.ncb i -nlm.nih. gov/BLAST/) . By comparing to highest homologous 
ORFs, the position of the ATG initiation codon was determined. All the sequences 
described herein were shown to have a predicted full length ORF and to include the 
predicted ATG starting codon. 

Cloning into the pPI expression vector - For cloning genes of the present 
invention, total RNAs from the various developmental stages of fiber producing cells 
was extracted, using Hot Borate RNA Extraction from Cotton Tissue according to 
www.eeob.iastate.edu/facultv/Wen^ Complementary DNA 

(cDNA) molecules were produced from mRNA using M-MuLV reverse-transcriptase 
(RT) enzyme (Roche) and TiaNN DNA primer, following protocol provided by the 
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manufacturer. cDNA amplification was done for 19 genes, out of the sequences 
above, namely CT clones number 1, 2, 3, 6, 7, 9, 1 1, 20, 22, 27, 40, 71, 74, 75, 76, 81, 
82, 84 and 88, by PCR using PFU proof reading DNA polymerase enzyme (Promega 
www.promegaxom/pnotes/68/73 8 1_07/73 8 l_07.html) following the protocol 
provided by the manufacturer. Primers for each gene were designed to span the full 
ORF. Additional restriction endonuclease sites were added to the 5' end of each 
primer to facilitate further cloning of the CTs to the binary vector (pPI). Table 7 
below, lists the primers used for cloning each of the genes: 
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The resultant PCR blunt ended products, were purified using PCR Purification 
Kit (Qiagen, Germany), digested with the appropriate restriction endonucleases 
(Roche) and cloned into the pPI binary vector (Figure 4), while replacing the existing 
GUS reporter gene. pPI is a modified version of pBI10L3 (Clontech, Accession No. 
5 U 12640). pPI was constructed by inserting a synthetic poly-(A) signal sequence, 
which originated from pGL3 Basic plasmid vector (Promega, Acc No U47295, where 
the synthetic poly-(A) signal sequence is located between base-pairs 4658-4811), into 
the HindTTT restriction site of pBI101.3 (while reconstituting the Hindin site, 
downstream to the poly-(A) insert), to avoid the possibility of read-through effect of 

10 the upstream Nos-promoter. To replace the GUS gene with each one of the CT genes 
in the pPI binary vector, pPI was digested with the appropriate restriction en2ymes [5* 
prime restriction enzyme is either Smal or Xbal and 3' prime restriction enzyme is 
either Sad or EcoRV (Roche- using the protocol provided by the manufacturer)]. 
Open binary vector was purified using PCR Purification Kit (Qiagen, Germany). 5-75 

15 ng of PCR product of each of the CT genes and 100 ng of open pPI plasmid vector 
were ligated in 10 ^L ligation reaction volume using T4 DNA ligase enzyme (Roche), 
following the protocol provided by the manufacturer. Ligation products were 
introduced into E, coli cells. 

Recombinant expression in bacteria - 60 pL of E. coli, strain DH5-a 

20 competent cells (about 10 9 cells/mL) were transformed using ljal of ligation reaction 
mixture by electroporation, using a MicroPulser electroporator (Biorad), 0.2 cm 
cuvettes (Biorad) and EC-2 electroporation program (Biorad). E. coli cells were 
grown on 0.8 mL LB liquid medium at 37 °C for 1 hrs and 0.2 mL of the cell 
suspension were plated on LB-agar plates supplemented with the antibiotics 

25 kanamycin 50 mg/L (Sigma). Plates were then incubated at 37 °C for 16 hrs. 
Bacteria colonies were grown and expression was confirmed by PCR amplification 
using primers which were designed to span the inserted sequence in the binary vector. 
Primers used for DNA amplification of the inserts in the pPI binary vector were: 
5-GGTGGCTCCTACAAATGCC ATC-3 ' (forward, SEQ ID NO. 70) and 5'- 

30 AAGTTGGGTAACGCC AGGGT-3 ' (reverse, SEQ ID NO. 71). 

PCR products were separated on 1.5 % agarose gels and product sizes were 
estimated by comparing to DNA ladder (MBI Fermentas). PCR products with the 
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predicted size were sequenced using the same primers previously used for PCR 
amplification (See Table 7, above). 

Additional primers, which were designed based on the sequence of each gene insert, 
were used to complete the sequencing of the full length ORF insert. 
5 Sequencing of the inserted sequence was performed to verify that the clones were 
introduced in the right orientation, and to eliminate the possibility that sequence errors 
were included during PCR amplification. DNA sequences were determined using ABI 
377 sequencer (Amersham Biosciences Inc). 

Into each one of the 19 pPI binary constructs harboring the CT genes, the 
10 constitutive, Cauliflower Mosaic Virus 35S promoter was cloned. 

Cauliflower Mosaic Virus 35S promoter sequence, originated from the pBI121 
vector (Clontech, Accession No AF485783) was cloned by digesting the pBI121 
vector with the restriction endonucleases Hindlll and BamHI (Roche) and ligated into 
the binary constructs, digested with the same enzymes (SEQ ID NO. 3 1). 

15 

EXAMPLE 6 

Agrobacterium transformation of binary plasmids harboring the genes of interest 
and expression in Arabidopsis and tomato plants 
Each of the nineteen binary constructs, comprising the 35S promoter upstream 
20 of each of the CTs genes was transformed into Arabidopsis or tomato plants via 
Agrobacterium tumefacience transformation. 

60 pL of Agrobacterium tumefaciens GV301 or LB4404 competent cells 
(about 10 9 cells/mL) were transformed with 20 ng of binary plasmid via 
electroporation, using a MicroPulser electroporator (Biorad), 0.2 cm cuvettes (Biorad) 
25 and EC-2 electroporation program (Biorad). 

Agrobacterium cells were grown on 0.8 mL LB liquid medium at 28 °C for 3 
hrs and 0.2 mL of the cell suspension were plated on LB-agar plates supplemented 
with the antibiotics gentamycin 50 mg/L (for Agrobacterium strains GV301) or 
streptomycin 300 mg/L (for Agrobacterium strain LB4404) and kanamycin 50 mg/L 
30 (Sigma). Plates were then incubated at 28 °C for 48 hrs. Agrobacterium colonies 
were grown and PCR amplification was performed on Agrobacterium cells, using 
primers which were designed to span the inserted sequence in the binary vector. 
Primers used for PCR amplification were: 5- 
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GGTGGCTCCTAC AAATGCC ATC-3 5 (forward, SEQ ID NO. 70) and 5'- 
AAGTTGGGTAACGCC AGGGT-3 ' (reverse, SEQ ID NO. 71). 

PCR products were separated on 1.5 % agarose gels and product sizes were 
determined by comparing to DNA ladder (MBI Fermentas). PCR products with the 
5 predicted size were sequenced using the primers which were used for the PCR 
amplification. Sequencing of the inserted sequence was performed to verify that the 
right clones were introduced into the Agrobacterium cells. 

DNA sequencing was effected using ABI 377 sequencer (Amersham Biosciences 
Inc.). 

1 o Plant transformation and cultivation: 

Transformation of Arabidopsis thaliana plants with putative cotton genes - 
Arabidopsis thaliana Columbia plants (TO plants) were transformed using the Floral 
Dip procedure described by Clough and Bent and by Desfeux et al., with minor 
modifications. Briefly, TO Plants were sown in 250 ml pots filled with wet peat-based 

15 growth mix. The pots were covered with aluminum foil and a plastic dome, kept at 4 
°C for 3-4 days, then uncovered and incubated in a growth chamber at 18-24 °C 
under 16/8 hr light/dark cycles. The TO plants were ready for transformation six days 
prior to anthesis. Single colonies of Agrobacterium carrying the binary constructs, 
were cultured in LB medium supplemented with kanamycin (50 mg/L) and 

20 gentamycin (50 mg/L). The cultures were incubated at 28 °C for 48 hrs under 
vigorous shaking and then centrifuged at 4,000 rpm for 5 minutes. The pellets 
comprising Agrobacterium cells were re-suspended in a transformation medium 
containing half-strength (2.15 g/L) Murashig-Skoog (Duchefa); 0.044 \iM 
benzylamino purine (Sigma); 112 jxg/L B5 Gambourg vitamins (Sigma); 5 % sucrose; 

25 and 0.2 ml/L Silwet L-77 (OSI Specialists, CT) in double-distilled water, at pH of 5.7. 
Transformation of TO plants was effected by inverting each plant into an 
Agrobacterium suspension, such that the above ground plant tissue was submerged for 
3-5 seconds. Each inoculated TO plant was immediately placed in a plastic tray, then 
covered with clear plastic dome to maintain humidity and was kept in the dark at 

30 room temperature for 18 hrs, to facilitate infection and transformation. Transformed 
(i.e., transgenic) plants were then uncovered and transferred to a greenhouse for 
recovery and maturation. 
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The transgenic TO plants were grown in the greenhouse for 3-5 weeks until 
siliques were brown and dry. Seeds were harvested from plants and kept at room 
temperature until sowing. For generating Tl transgenic plants harboring the genes, 
seeds collected from transgenic TO plants were surface-sterilized by soaking in 70% 
5 ethanol for 1 minute, followed by soaking in 5% sodium hypochloride and 0.05% 
triton for 5 minutes. The surface-sterilized seeds were thoroughly washed in sterile 
distilled water then placed on culture plates containing half-strength Murashig-Skoog 
(Duchefa); 2 % sucrose; 0.8 % plant agar; 50 mM kanamycin; and 200 mM 
carbenicylin (Duchefa). The culture plates were incubated at 4 °C for 48 hours then 

10 transferred to a growth room at 25 °C for an additional week of incubation. Vital Tl 
Arabidopsis plants were transferred to a fresh culture plates for another week of 
incubation. Following incubation the Tl plants were removed from culture plates and 
planted in growth mix contained in 250 ml pots. The transgenic plants were allowed 
to grow in a greenhouse to maturity. 

15 Transformation of Micro-Tom tomato plants with putative cotton genes - 

Tomato (Lycopersicon esculentum, var MicroTom) transformation and cultivation of 
transgenic plants was effected according to Curtis et aL 1995, and Meissner et ah 
2000. 

2 0 EXAMPLE 7 

Growth of Arabidopsis transformed plants and phenotype characterizations 
Tl arabidopsis plants were grown as described above and phenotypes were 
characterized. 

PCR analysis of transgenic plants - Arabidopsis T2 seeds were sown directly 
25. in growth mix contained in 250 ml pots. Positive transgenic plants were screen for 
kanamycin resistance in two weeks old leaves by PCR. Primers used for PCR 
amplification of the kanamycin were: 5'- CTATTCGGCTATGACTGGGC -3' 
(forward, SEQ ID NO. 72) and 5'- ATGTCCTGATAGCGGTCCGC -3' (reverse, 
SEQIDNO.73). 

30 Root performance - In order to visualized root performance, T2 seeds were 

surface-sterilized by soaking in 70 % ethanol for 1 minute, followed by soaking in 5 
% sodium hypochloride and 0.05 % triton for 5 minutes. The surface-sterilized seeds 
were thoroughly washed in sterile distilled water and then placed in culture plates 
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containing half-strength Murashig-Skoog (Duchefa); 2% sucrose; 0.8% plant agar; 50 
mM kanamycin; and 200 mM carbenicylin (Duchefa). The culture plates were 
incubated at 4 °C for 48 hours then transferred to a growth room at 25 °C till reaching 
the right size for phenotypic characterization. 
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EXAMPLE 8 

Growth of MicroTom transformed plants and phenotype characterizations 
Experimental Procedures 

Transgenic tomato plants - Plant were transformed as described in Example 
6, above. Following transformation, Tl MicroTom tomato plants were grown in mix 
contained in 1000 ml pots. 
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Discussion 
(Examples 1-8) 

In-silico identification of genes involved in cotton fiber development - Little 
is known about the genetic control of cotton fiber initiation and elongation. Since 
5 both cotton fiber and Arabidopsis trichomes are developed from single epidermal 
cells they are assumed to share similar genetic regulation (Reviewed at Wagner GJ. 
et ah 2004). In Arabidopsis, a large number of studies have revealed extensive 
information on the genetic mechanisms regulating trichome initiation and elongation. 
Several studies demonstrated the similarities between trichome and fiber by showing 

10 that cotton fiber specific promoters in arabidopsis and tobacco plants confer trichome 
specific expression (Kim and Triplett, 2001; Hsu et al. 1999; Liu et al. 2000, Wang 
et al. 2004). Most of the research that studies fiber development uses arabidopsis 
trichome as a model system to identify cotton genes in a small scale manner (Kim and 
Triplett, 2001; Wang et al. 2004). 

15 In this study the present inventors have used tomato trichome and flower EST 

libraries as model systems to study cotton fiber development. Analysis of the EST 
libraries profile of the tomato homologous clusters to known arabidopsis trichome 
genes showed that tomato trichome and flower EST libraries significantly contributed 
to this set of clusters. 

20 This result was confirmed while analyzing the EST libraries profile of the new 

cotton clusters that were selected by their RNA expression pattern as cotton fiber 
genes. 9 and 10 clusters contained ESTs which originated from the flower and 
trichome libraries respectively. Furthermore the group of tomato trichome clusters 
(trichome ESTs/total ESTs> 0.1) comprise large portion from the tomato genes that 

25 show high degree of homology, to cotton (~ 50 %) even though their percentage in the 
total population is only -5 %. It may indicate that both organ share common 
developmental processes. Even though there is a large group of studies about the 
genetic control of tomato fruit and trichome development no publications could be 
found to use these organs as a source of genomic data to study cotton fiber 

30 development. All of the 23 cotton genes were compared to unique EST data produced 
separately from embryo and suspensor of Scarlet Runner bean developing seeds 
(vmw.mcdb.ucla.edu/Research/Goldberg/ests/mtro-index.htmV All sequences, except 
one, share high homologies with sequences originated from the suspensor, which is a 
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maternal tissue. This result supports the in silico results and identifies the role of these 
cotton clusters in fiber development, which originated from maternal cells as well. 

Identifying cotton genes with a role in fiber development through analysis of 
RNA expression profile - The differentiation/initiation phase is represented by gene 

5 expression at or before anthesis. The elongation phase mainly in hirsutum cultivars is 
represented by very fast growth rate mainly during 5 to 20 DP A. One pattern is 
represented by genes such as CT 1, 2, 3 expressed at their highest levels, slightly 
before and during the period of peak fiber expansion about 20 DP A. Another pattern 
of gene expression is displayed by the CT40, 11 or 70 which have the same 

10 expression level throughout all fiber development. Likewise, known genes encoding 
actin, endoxyloglucan transferase or Sue synthase also display unvarying RNA levels 
throughout fiber development (Shimizu et al., 1997). 

Since the initiation occurs mainly before anthesis till 1 DPA it suggests that 
genes with a peak in expression during this time may have a role in fiber initiation. 

15 CT 4, 20, 22 and 11 have expression patterns that indicate their involvement at this 
stage. 

One limitation of the current cotton EST database is the absence of ESTs that 
were extracted from flower at initiation stage (there is one library that was taken from 
ovary 1 DPA but of poor quality) most ESTs were taken only later on, between 6 to 

20 10 DPA. This EST composition could explain why most of the chosen genes have 
expression pattern that indicate their association with the elongation stage. 

Role of the selected genes in fiber development, possible mechanisms - The 
23 fiber-associated clusters could be classified into 6 functional categories according 
to their sequence homology to known proteins and enzymes (Table 3, above). The 

25 classification was made according to the GO consortium (www.geneontology.org/). 
The largest group comprises unique sequences without homology to any known 
protein. The rest of the clusters were classified according to categories known to be 
associated with fiber development. Two genes (Table 3, above) were classified into a 
cell fate commitment category: a new gene that belongs to the MYB transcription 

30 factor and a cotton homologous gene to GL3 that are known to be involved in 
trichome development in arabidopsis. The expression pattern of both genes and the 
phenotype of CT20 transgene both in arabidopsis and tomato Tl plants support their 
involvement mainly in the initiation phase. 
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Accumulative evidence link cotton MYB genes with fiber development (Suo. 
J. et al 2003, Cerdoni. M.L. et al 2003, Loguerico L.L. et al 1999). Over 
expression of a number of genes that work in the same pathway related to the 
initiation phase, could further induce initiation. Kirik et al. (2004) showed that by 

5 over-expressing two or three genes from the initiation phase they enhance the number 
of trichome and root hairs. Genes that relate to the initiation phase could be used for 
uniformity of fiber initiation on the cotton seed, initiate of more of the seeds 
epidermis cells into fibers. Over expression of those genes in vegetative meristems 
such as stems and leaves could be used as protect against insects (as has been shown 

10 in canola, www.westerngrains.com/news/nr_050413.html) and a-biotic stresses. 
However, there is no substantial evidence that proves direct involvement of any MYB 
gene to fiber development. 

Two other genes (Table 3, above) are transcription factors from the MYB and 
MADS BOX families. Many studies demonstrated the function of these two 

15 transcription factor families as homeotic genes with key role in different 
developmental processes, among them are trichome and fiber morphogenesis (Suo. J. 
et al 2003, Ferrario S et al 2004). Their role in early stages of fiber development is 
supported also by their RNA expression pattern, which, is induced before, and during 
the day of anthesis. One gene (CT_2, Table 3, above) was classified to the pathways 

20 of starch and sucrose metabolism. A recent work demonstrates that another gene 
(SUS), which, belongs to this pathway, is a limiting factor in both fiber initiation and 
development. CT_40, 75 were classified as lipid transport whose RNA expression is 
highly induced during early fiber elongation stage fit to the fact that lipids are key 
components in fiber formation. Several genes (Table 3, above, CT_4, 70, 71) were 

25 classified either as genes involved in desiccation, salinity response stimulated by 
abscisic acid and genes involved in electron transfer. Out of them 3 genes (CT 7, 9 
and 49) were selected by RNA expression pattern to be induced in the elongation 
stage. Several studies consider changing proton and potassium pump mechanisms as 
key factor in the rapid growth rate of the fiber (Smart L.B, et al 1998). Combine the 

30 over-expression of several genes relate to fiber elongation such as genes relate to 
starch and sucrose metabolism that will enhance cell wall formation with lipid 
transport genes or genes relate to desiccation that my influence on the pressure in the 
cell, might result in longer fibers then over expressed of single gene. 



WO 2005/121364 



PCT/IL2005/000627 



57 

EXAMPLE 9 

Cloning and analyses of promoter sequences upstream of the genes of the present 

invention 

Differential gene expression in fiber tissues vs. other tissues in cotton is the 
result of complicated gene regulation. The genomic regions upstream of the 23 
selected genes are predicted to possess promoter activities that direct gene expression 
to fiber cells in unique quantitative and qualitative manner. A precise gene expression, 
directed to fiber cells, is crucial for the development of cotton plants with enhanced 
fiber performance, without negatively affecting other plant tissues. 

Experimental Procedures 

Cloning of promoter sequences - The genomic sequence upstream of CT2 and 
CT6 were cloned from genomic DNA of cotton {Gossypium hirsutum L. var Acala), 
as follows. Total genomic DNA was extracted from plant leaf tissues of 4 week old 
cultivated cotton plants {Gossypium hirsutum Z., var Acala), using DNA extraction kit 
(Dneasy plant mini kit, Qiagen, Germany). Inverse PCR (IPCR), DNA digestion, 
self-ligation, and PCR reaction were performed on genomic DNA, following common 
protocol (www.pmci.unimelb.edu.au/core facilities/manual/mb3 go.aspl with the 
following modifications. To avoid mistakes in the IPCR, the genomic sequence of the 
5' sequence of a relevant cDNA (i.e. including introns) was first identified to produce 
Genomic Island (GI). The desired region from the genomic DNA was PCR-amplified 
using direct oligonucleotide primers designed based on the cDNA cluster sequence 
(for CT_2 and CT_6, respectively GI sequences are as set forth in SEQ ID NOs. 74 
and 75 for CT_2 and CT_6. Primers are set forth in SEQ ID NOs. 14-15 (CT_2) and 
101-102 CT_6). PCR reaction was performed in a DNA thermal cycler, using 
common PCR protocols. For example: 

92 °C/3 min ^ 31 x [94 °C/30 sec 56 °C/30 sec -+ 72 °C/3 min] -> 72 °C/10 
min). 

PCR products were purified using PCR purification kit (Qiagen) and 
sequencing of the amplified PCR products was performed, using ABI 377 sequencer 
(Amersham Biosciences Inc). 

In some cases, a different technique [UP-PCR (Dominguez and Lopez-Larrea. 
1994)] was used when IPCR resulted in poor amplification. UP-PCR technique was 
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used in order to amplify unknown upstream region of known cluster sequences. 
Generally, the procedure involved four oligonucleotide primers: two sequence 
specific primers (SPs, external and internal) (listed below), both with same orientation 
of 3' end towards the unknown, yet desired, 5' region of the gene, and two universal 
5 walking primers (WP28 5'- Tlli rri - 1TTTG TTTGTTGTGGGGGTGT (SEQ ID 
NO. 76 and sWP 5'- TTTTTGTTTGTTGTGGG, SEQ ID NO. 77). Reactions were 
carried out using the following reaction mixtures: sample mixture (SM) - genomic 
DNA of cotton species (30-40ng), WP28 primers (20 pmol), and double distilled 
water was added to a final volume of 10 pi. Polymerase mixture (PM) — dNTPs 

10 (Roche, Switzerland, lOnmol each), Expand Long Template Enzyme mix (Roche, 
Switzerland, 1U), 10 x buffer supplied with the enzyme and double distilled water 
was added to a final volume of 8 pi. 

SMs were placed in a thermocycler (Biometra, USA), where it was subjected 
to an amplification program of 1 minute at 90 °C, held (pause) at 80 °C until PM 

15 was added, 30 seconds at 15 °C, 10 minutes at 25 °C, 3 minutes at 68 °C, held at 90 
°C until the external SP (2 pi of 10 pM concentration) was added. The process was 
followed by external PCR reaction of 30 seconds at 92 °C, 10 seconds at 94 °C, 30 
seconds at. 65.5° C, 3 minutes at 68 °C, for 30 cycles followed by final extension of 
10 minutes at 68 °C. 

20 External PCR product diluted 5000 - 25000 fold was used as a template, and 

PCR amplification was effected using specific internal sWP and SP (30 pmol each) 
primers, 1U Ex Taq (Takara), in 50pl reaction volume. Internal PCR reaction was 
subjected to an amplification program of 2 minutes at 92 °C, followed by 30 seconds 
at 94 °C, 30 seconds at 58 °C, and 3 minutes at 72 °C for 30 cycles and a final 

25 extension of 10 minutes at 72 °C. IPCR / Up-PCR products were purified (PCR 
Purification Kit, Qiagen, Germany) and sequenced (ABI 377 sequencer, Amersham 
Biosciences Inc). 

Primers for CT_2 were as follows (UP-PCR): 
External primers: 

30 sWP28- 5 5 - TTTTTTTTTTTC (SEQ ID NO. 78) 

SP (External)- 5'- CTGGGGTTACTTGCTAATGG -3' (SEQ ID NO: 79) 
Internal (Nested) primers: 

sWP- 5'- TTTTTGTTTGTTGTGGG -3' (SEQ ID NO: 80) 
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SP (Internal)- 5'- GCTCCGGGCTTTGGTTAACG -3' (SEQ ID NO: 81) 
Internal genomic sequence of CT_2 resulting from the above procedure is 
provided in SEQ ID NO: 14. 

Primers for CT_6 were as follows (UP-PCR): 
5 External primers: 

sWP28- 5'- TTTTTTTTTTTC ' (SEQ ID NO. 78 

SP (External)- 5'- GGCTTTGGGATGTTTGAGGTGG -3' (SEQ ID NO. 82) 
Internal (Nested) primers: 

sWP- 5'- TTTTTGTTTGTTGTGGG -3' (SEQ ID NO: 83) 
10 SP (Internal)- 5'- GGTGGTGGGCTCTTGCAACAG -3' (SEQ ID NO: 84) 

Internal genomic sequence of CT 2 resulting from the above procedure is 
provided in SEQ ID NO: 85. 

For cloning the putative promoters and 5' UTRs, PCR amplification was 
carried out using a new set of primers (below) to which 8-12 bp extension that 
15 included one restriction site (HindUI, Sail, Xbal, BamHL, or Smal) on the 5' prime 
end. For each promoter, restriction sites that do not exist in the promoter sequence 
were selected. Moreover, the restriction sites in the primer sequences were design so 
the resultant PCR products will be cloned into the binary vector pPI in the right 
orientation, upstream of the GUS reporter gene. 
20 The plasmid pPI was constructed by inserting a synthetic poly-(A) signal 

sequence, originating from pGL3 basic plasmid vector (Promega, Acc No U47295; bp 
4658-4811) into the HindUI restriction site of the binary vector pBI101.3 (Clontech, 
Accession No. Ul 2640). 

Below are the primers used for promoter and 5' UTR (P+U) amplification and 
25 cloning into pPI, and the amplified and cloned sequence. Restriction sites within each 
primer are shown in bold letters: 
CT_2: 

P+U forward (HindUI): 5'- ATTCAAGCTTTTTTTGTTTGTTGTGGGGG - 
3' (SEQ ID NO: 86) 

30 ( P+U reverse (BamHI): 5'- TTGGATCCTTGGGCATTGAGCTTCTGTAC - 

3' (SEQ ED NO: 87) 

P+U sequence of CT_2 is as set forth in SEQ ID NO: 88. 
CT6: 
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P+U forward (Hindlll): 5'- TTAAAGCTTTGGGCTCTTGCAACAGAGGC - 
3' (SEQ ID NO: 89) 

P+U reverse (BamHI): 5'- AAGGATCCGACGACGACAACAACAACAAC 
-3' (SEQ ID NO: 90) 
5 P+U sequence of CTjS is as set forth in SEQ ID NO: 9 1 . 

Genomic DNA or the IPCR/UP-PCR product was used as DNA template for 
PCR-amplification, using the newly designed oligonucleotide primers. PCR products 
were purified (PCR Purification Kit, Qiagen, Germany) and digested with the 
restriction sites exist in the primers (Roche, Switzerland). The digested PCR products 
10 were re-purified and cloned into the binary vector pPI, which was digested with the 
same restriction enzymes. PCR product and the open plasmid vector were ligated 
using T4 DNA ligase enzyme (Roche, Switzerland). 

EXAMPLE 10 

Transforming Agrobacterium tumefacience cells with binary vectors harboring 

cotton fiber promoters 
pPi Binary vector, including either CT2 or CT6 promoter, upstream to the 
GUS reporter gene were used to transform Agrobacterium cells. 

The binary vectors were introduced to Agrobacterium tumefaciens GV3Q1, or 
LB4404 competent cells (about 10 9 cells/mL) by electroporation. Electroporation was 
performed using a MicroPulser electroporator (Biorad), 0.2 cm cuvettes (Biorad) and 
EC-2 electroporation program (Biorad). The treated cells were cultured in LB liquid 
medium at 28 °C for 3 hr, then plated over LB agar supplemented with gentamycin (50 
mg/L; for Agrobacterium strains GV301) or streptomycin (300 mg/L; for 
Agrobacterium strain LB4404) and kanamycin (50 mg/L) at 28°C for 48 hrs. 
Agrobacterium colonies which developed on the selective media were analyzed by 
PCR using the primers set forth in SEQ ID NOs: 70-71, which were designed to span 
the inserted sequence in the pPI plasmid. The resulting PCR products were isolated 
and sequenced as described in Example 4 above, to verify that the correct sequences 
were properly introduced to the Agrobacterium cells. 

EXAMPLE 11 



20 



25 
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Cotton fiber specific promoters are expressed in tomato leaves and tomato fruits 
GUS staining was effected to illustrate specific expression in trichomes and 
tomato fruits. 

Experimental Procedures 
5 Transformation of Micro-Tom tomato plants with putative cotton promoter - 

As describe above. 

Transformation of Arabidopsis thaliana plants with putative cotton 
promoter - As describe above. 

GUS staining of Arabidopsis- Gus staining of arabidopsis plants was effected 
10 as previously described (Jefferson RA. et. al. 1987, Meissner et al. 2000). 

GUS staining of tomato leaves - Gus staining of tomato plants was effected as 
previously described (Jefferson RA. et. al. 1987, Meissner et. al. 2000). 

Tissue fixation was effected as follows. Tomato leaves were immersed in 90 
% ice cold acetone, then incubated on ice for 15 - 20 minutes following by removal of 
15 the acetone. Thereafter tissue was rinsed twice with the Working Solution [100 mM 
Sodium Phosphate (Sigma, USA) buffer pH=7, Ferricyanide (Sigma, USA) 5 mM, 
Ferrocyanide (Sigma, USA) 5 mM, EDTA (BioLab) pH=8 1 mM , Triton X-100 
(Sigma, USA) 1 %] for 15-20 minutes in dark. Rinsing solution was then removed 
and replaced with X-gluc staining solution [Working Solution + 5-bromo-4-chloro-3- 
20 indoiyl-P-D-glucuronic acid (X-GlcA, Duchefa) solubilized in N,N- 
Dimethylformamide (BioLab) 0.75mg/ml , Dithiothreitol (BioLab) lOOmM] and 
incubated for over night at 37 °C in the dark (tubes wrapped with aluminum foil). 
Distaining was effected by sinking the plant tissue in 70 % ethanol and heating at 50. 
°C for -120 minutes. Distaining step was repeated until the plant tissue became 
25 transparent excluding the blue stained regions. Distained plants were stored in 70 % 
ethanol (BioLab) at room temperature. 

GAS staining of Tomato Fruits - Gus staining of tomato fruits was effected as 
previously described (Jefferson RA. et al. 1987, Meissner et al. 2000). Briefly: thin 
tomato fruit slice were sunk in staining solution [100 mM Sodium Phosphate (Sigma, 
30 USA) buffer pH=8, Ferricyanide (Sigma, USA) 5 mM, Ferrocyanide (Sigma, USA) 5 
mM, EDTA (BioLab) pH=8 15mM , Methanol (BioLab) 20 % , 5-bromo-4-chloro-3- 
indolyl-P-D-glucuronic acid (X-GlcA, Duchefa) solubilized in N,N- 
Dimethylformamide (BioLab) 0.75mg/ml] in the dark (tubes wrapped with aluminum 
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foil) and incubated for over night at 37 °C. Distaining was effected by sinking the 
plant tissue in 70 % ethanol and heating to 50 °C for -20 minutes. Distaining step 
was repeated until the fruit slice became transparent except for the blue stained 
regions. Distained fruits were stored in 70 % ethanol (BioLab) at room temperature. 
5 Results 

GUS staining was performed on Seeds of Tl tomato plants. 

GUS was expressed under the regulation of CT2 and CT6, promoters in the 
genetically transformed tomato plants (Figures 7a-b). 

Results for tomato Tl generation are summarized in the Table 10, below. 

10 

Table 10 



Promoter 


No of 


Leaf 


Leaf 


Seed cover 


Seed cover 


Seed cover 




Independent 




trichome 


of Young 


of Mature 


of Ripen 




Tl plants 






fruit 


green 


fruit 


CT2 


four 


0 


2 


3 


5 


3 


CT6 


one 


0 


1 


1 


2.5 


1 



The numbers represent average grade, 0-not expressed, 5-high expression 



EXAMPLE 12 
Tomato seed hairs as a model system for cotton fibers 
15 The genetic modification of cotton is long and time consuming. Hence to find 

genes which are capable of improving cotton fiber yield and quality, a need exists for 
a model system for cotton fiber development in other plants. 

Trichoma cells and root hairs share common characteristics with cotton fiber 
cells, and are widely accepted as model systems for cotton fiber development 
20 [Reviewed in Wagner. G.J. et. al. 2004) and Wang et al. 2004]. 

However measuring changes in growth rate, length and thickness as well as 
other structural parameters is not an easy task because of the small size, remote 
accessibility and lack of uniformity in sizes of trichome cells. 

To overcome these limitations, tomato seed hairs were analyzed for their 
25 possible use as a model tissue for cotton fiber development. To this end, the GUS 
reporter gene was over-expressed under the regulation of cotton fiber specific 
promoter element derived from CT2, as describe above. 

Tomato transformation of the binary construct, plant regeneration and GUS 
staining was effected as described above. 
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Tomato seed hairs (Figure 8a) are maternal epidermal cells, covering the ovule 
surface of the seeds. In anatomical aspects, tomato seed hairs are much closer to 
cotton fibers than either trichome cells or root hairs. 

4 independent transgenic tomato fruits over-expressing GUS gene under 
cotton specific promoter CT_2 were produced. GUS staining of fruits at the mature- 
green stage (fruit is in full size just before the ripening process) was observed 
uniquely on the seed envelope, where seed hairs are being developed (Figures 7a and 

b). 

Five independent transgenic tomato fruits over-expressing 35S-expansin 
(AF043284) were produced, and the seed hair length was measured and compare to 
wt. The seed hair of transgenic plants was significantly longer than of wt (Figures 8a- 

b). 



Table 11 



Plant 


Number of Independent plant 


Seed hair length (mm) 


WT 


3 


0.300±0.019 


35S:expansin 


5 


0.357±0.017 (Figure 8b) 



It is appreciated that certain features of the invention, which are, for clarity, 
described in the context of separate embodiments, may also be provided in 
combination in a single embodiment. Conversely, various features of the invention, 
which are, for brevity, described in the context of a single embodiment, may also be 
provided separately or in any suitable subcombination. 

Although the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and variations 
will be apparent to those skilled in the art. Accordingly, it is intended to embrace all 
such alternatives, modifications and variations that fall within the spirit and broad 
scope of the appended claims. All publications, patents and patent applications and 
GenBank Accession numbers mentioned in this specification are herein incorporated 
in their entirety by reference into the specification, to the same extent as if each 
individual publication, patent or patent application or GenBank Accession number was 
specifically and individually indicated to be incorporated herein by reference. In 
addition, citation or identification of any reference in this application shall not be 
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construed as an admission that such reference is available as prior art to the present 
invention. 
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1. An isolated polynucleotide comprising a nucleic acid sequence 
encoding a polypeptide having an amino acid sequence at least 80 % homologous to 
SEQ ID NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 
or 96, wherein said polypeptide is capable of regulating cotton fiber development. 

2. The isolated polynucleotide of claim 1, wherein said nucleic acid 
sequence is selected from the group consisting of SEQ ID NOs. 1, 2, 4, 5, 7, 9, 10, 16, 
17, 20, 21, 22, 24, 25, 27 and 13. 

3. The isolated polynucleotide of claim 1, wherein said polypeptide is as 
set forth in SEQ ID NO. 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 
124, 126, 95 or 96. 

4. The isolated polynucleotide of claim 1, wherein said amino acid 
sequence is as set forth in SEQ ID NO. 26, 106, 107, 109, 110, 112, 114, 115, 118, 
119, 122, 123, 124, 126, 95 or 96. 

5. The isolated polynucleotide of claim 1, wherein said cotton fiber 
development comprises fiber formation. 

6. The isolated polynucleotide of claim 1, wherein said cotton fiber 
development comprises fiber elongation. 

7. An isolated polynucleotide comprising a nucleic acid sequence at least 
80 % identical to SEQ ID NO: 85 or 91, wherein said nucleic acid sequence is capable 
of regulating expression of at least one polynucleotide sequence operably linked 
thereto in an ovule endothelial cell. 

8. The isolated polynucleotide of claim 7, wherein said ovule endothelial 
cell is of a plant fiber or a trichome. 
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9. An oligonucleotide capable of specifically hybridizing to the isolated 
polynucleotide of claim 1 or 7. 

10. A nucleic acid construct comprising the isolated polynucleotide of 
claim 1. 

11. A nucleic acid construct comprising the isolated polynucleotide of 
claim 7. 

12. The nucleic acid construct of claim 10, wherein the nucleic acid 
construct further comprising at least one cis-acting regulatory element operably linked 
to the isolated polynucleotide. 

13. The nucleic acid construct of claim 7, wherein said polynucleotide 
sequence is selected from the group consisting of SEQ ID NOs: 1, 2, 4, 5, 7, 9, 10, 16, 
17, 20, 21, 22, 24, 25, 27 and 13. 

14. The nucleic acid construct of claim 12, wherein said cis-acting 
regulatory element is as set forth in SEQ ID NO: 74, 75, 85 or 91 or functional 
equivalents thereof. 

15. A transgenic cell comprising the nucleic acid construct of claim 10 
and/or 11. 

16. A transgenic plant comprising the nucleic acid construct of claim 10 
and/or 11. 

17. A method of improving fiber quality and/or yield of a fiber producing 
plant, the method comprising regulating an expression level or activity of at least one 
polynucleotide encoding a polypeptide having an amino acid sequence at least 80 % 
homologous to SEQ ED NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 
123, 124, 126, 95 or 96 in the fiber producing plant, thereby improving the quality 
and/or yield of the fiber producing plant. 
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18. The method of claim 17, wherein the quality of the fiber producing 
plant comprises at least one parameter selected from the group consisting of fiber 
length, fiber strength, fiber weight per unit length, maturity ratio, uniformity and 
micronaire. 

19. The method of claim 17, wherein said regulating expression or activity 
of said at least one polynucleotide is up-regulating. 

20. The method of claim 19, wherein said up-regulating is effected by 
introducing into the cotton the nucleic acid construct of claim 10 and/or 11. 

21 . The method of claim 17, wherein said regulating expression or activity 
of said at least one polynucleotide is down-regulating. 

22. The method of claim 21, wherein said down-regulating is effected by 
gene silencing. 

23. The method of claim 22, wherein said gene silencing is effected by 
introducing into the cotton the oligonucleotide of claim 9. 

24. The method of claim 17, wherein said fiber producing plant is selected 
from the group consisting of cotton, silk cotton tree (Kapok, Ceiba pentandra), desert 
willow, creosote bush, winterfat, balsa, ramie, kenaf, hemp, roselle, jute, sisal abaca 
and flax. 

25. A method of increasing a biomass of a plant, the method comprising 
regulating an expression level or activity of at least one polynucleotide encoding a 
polypeptide having an amino acid sequence at least 80 % homologous to SEQ ID NO: 
26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 or 96 in the 
plant, thereby increasing the biomass of the plant. 



26. The method of claim 25, wherein the plant is a monocot plant. 
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27. The method of claim 25, wherein the plant is a dicot plant. 



28. A method of identifying genes which are involved in cotton fiber 
development, the method comprising: 

(a) providing expressed nucleic acid sequences derived from cotton fibers; 

(b) providing expressed nucleic acid sequences derived from an ovule 
tissue; 

(c) computationally assembling said expressed nucleic acid sequences of 
(a) and (b) to generate clusters; and 

(d) identifying clusters of said clusters which comprise expressed nucleic 
acid sequences of (a) and (b), thereby identifying genes which are 
involved in cotton fiber development. 

29. The method of claim 28, further comprising identifying genes which 
are differentially expressed in said cotton fiber following (d). 



30. The method of claim 29, wherein said differentially expressed 
comprises: 

(a) specific expression; and/or 

(b) change in expression over fiber development. 



31. A method of producing an insect resistant plant, comprising regulating 
an expression level or activity of at least one polynucleotide encoding a polypeptide 
having an amino acid sequence at least 80 % homologous to SEQ ID NO: 26, 106, 
107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 126, 95 or 96 in a trichome of 
the plant, thereby producing the insect resistant plant. 



32. A method of producing cotton fibers, the method comprising: 
(a) generating a transgenic cotton plant expressing at least one polypeptide 
having an amino acid sequence at least 80 % homologous to SEQ ID 
NO: 26, 106, 107, 109, 110, 112, 114, 115, 118, 119, 122, 123, 124, 
126, 95 or 96; and 
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(b) harvesting the fibers of said transgenic cotton plant, thereby producing 
the cotton fibers. 
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SEQUENCE LISTING 

<110> Ronen, Gil 

Gold , Evgenia 
Yelin, Rodrigo 
Meissner, Rafael 
Karchi, Hagai 

<120> POLYNUCLEOTIDES AND POLYPEPTIDES INVOLVED IN PLANT FIBER 
DEVELOPMENT AND METHODS OF USING SAME 

<130> 29783 

<160> 126 

<170> Patentln version 3,3 

<210> 1 

<211> 1086 

<212> DNA 

<213> Gossypium hirsutum 

<400> 1 



cccgggatgg atggttattg 


tagcagaagg 


gtaatcatgt ttttggtgtt 


tgcatttgca 


60 


gcaataagca gagggtatgg 


acaagaatca 


accactcttg ttcctgcaat 


catcaccttt 


120 


ggtgactctg tggtagatgt 


gggcaataat 


gactatctcc ctaccatctt 


caaggctaac 


180 


tatcctcctt 


atggacggga 


ctttgccaac aaaaagccta ctgggaggtt 


ttgcaatgga 


240 


aaattagcca ctgacatcac 


tgctgaaact 


ctggggttta caacttatcc 


accagcttac 


300 


cttagcccag 


aagcatcagg 


gaagaacctt 


ctgcttggag ccaattttgc 


ttcagctggc 


360 


tctggctatg 


atgacaaagc 


tgccatggtg 


aatcatgcca tcacattgac 


ccagcaatta 


420 


gagtatttca 


aggaatacca 


ggcaaagcta 


gcaaaggtag caggcagcac 


caaatcagca 


480 


tccattacca 


aggatgcact 


gtatgtattg 


agtgcaggaa gcggtgactt 


cctccagaac 


540 


tactatgtca 


accctctact 


taaccatgcc 


tatactccag accagtacgg 


ctcattcctt 


600 


attgatacct 


tcacaaactt 


cgtcaagaac 


ctctatgggt tgggagctag 


gaaaattggg 


660 


gttacctcac 


ttccaccgtt 


aggttgcgtt 


ccattagcaa gaacattgtt 


cggttaccac 


720 


gagaaaggat 


gcatctccag 


gttcaatacc 


gatgctcaac aattcaataa 


aaagctcaac 


780 


gccgcagcag 


ccaatctcca 


gaagcagcat 


cctggtctta agattgtggt 


tttcgacata 


840 


ttcaaggcac 


tttacgacat 


tgttaaatct 


ccctctaact atggttttgt 


tgaagcaaca 


900 


aaagggtgtt gtggaactgg 


aacagtagag acaaccgcat ttttgtgcaa 


tccaaaggca 


960 


ccaggaactt gttccaatgc 


cagccaatat gtattttggg acagtgttca 


tccatctcag 


1020 


gctgctaatc aagtccttgc 


agatgcattg attgttcagg gcattgccct 


catttgattc 


1080 


gagctc 










1086 



<210> 2 
<211> 1975 
<212> DNA 

<213> Gossypium hirsutum 
<400> 2 

tctagacaag tacagaagct caattcccaa gatggaggcc agcagcagca cctcccatga 60 
cccagcattg ttccatgctc ccttgctata ccaccctcgg agaaggagca gcagaccctt 120 
aaagggtttc gcagtgataa ttgggtccgt cgttttccta ctctcactgg tcacattaat 180 
cgttaaccaa agcccggagc cattagcaag taaccccagt agtgtaacgg aggcagggtc 240 
gtattcaatg gcggcgcagc caagagggat agctgaaggt gtttcagcca agtcaaaccc 300 

1 
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atcacttttt 


gacaaagttg 


ggtttaattg gacaaacgct atgttttact 


ggcaaagaac 


360 


tgcctaccac 


tttcagcctc 


aaaagaattg gatgaatgat cctgacggtc 


cgttatatca 


420 


caagggatgg 


taccatcttt 


tctatcaata caaccctgat tcagccatat 


ggggaaacat 


480 


cacttggggc 


cacgctgtat 


caacggacct cattcactgg ttctatctcc 


cactcgccat 


540 


ggtccctgat 


caatggtacg 


atatcaacgg ttgttggacg gggtcggcca 


ctctcctgcc 


600 


agatggccga 


atcgtaatgc 


tttacaccgg cagcaccaat gactccgtgc 


aagtccaaaa 


660 


ccttgcatat 


cccgccaacc 


tatctgatcc cctcctcctt cagtggttaa 


aatacccggg 


720 


taacccggtt 


gttgttcccc 


caaccgggat cgaagacgaa gagttccgag 


acccgacaac 


780 


agcttggctt 


ggacccgatg 


gttcctggcg gattgttgtt ggtacaaggt 


ttaataccac 


840 


cataggaaca 


gcccttgttt 


ttcaaacgac aaacttttcg gactatgaat 


tattggatgg 


900 


ggtcttacat 


gctgttccgg 


gtacgggtat gtgggaatgt gtagattttt 


accccgttgc 


960 


aataaacggg 


tcggtcggac 


tggacacgac ggcacttggg cctggaatta 


agcatgtcct 


1020 


gaaggctagt 


ttggatgata 


cgaaagttga tcattatgca atagggacct 


acgacatgat 


1080 


aacggataaa 


tggacacctg 


ataacccgga agaagatgta ggcatcgggt 


tgaaagtgga 


1140 


ttatgggaga 


tactatgcct 


ccaagacatt ttttgatcag agtaaacaaa 


ggaggattct 


1200 


ttatggttgg 


gttaatgaaa 


ctgattctga agctgatgac ctcgaaaaag 


gatgggcttc 


1260 


cattcagaca 


attcccagga 


gtgtgttgta tgacaacaag accggaaccc 


atttactaca 


1320 


gtggcctgtg 


gaagaagtgg 


agagcttgag actgaatgct acagtgttta 


aggatgttgt 


1380 


agttgaagca 


ggatcagttg 


tgcccctcga cataggcacc gctactcagt 


tggatatatt 


1440 


agcagagttt 


gaaatagaga 


cgttggtatt gaacagcacg gaggatgaag 


tcagtgattg 


1500 


cggtgatggg 


gcggttgata 


ggagcactta cgggccattt ggggtcctgg 


ttattgctga 


1560 


tgattcactt 


tctgagctca 


ctcctatata tttccgtcca cttaatacat 


ccgatgggag 


1620 


tcttgaaact 


tacttttgcg 


ctgacgaaac aaggtcttct aaagctcccg 


atgtcacaaa 


1680 


acgagtgtat 




+" tecacrfccrcfc tcraccrafccraa aaetaeaar*;! 


t" era cr cr or t" a 1 1 


1740 


ggtggatcat 


tcagtagtgg 


aaagttttgg ag.gaggaggg aggacg'gtga 


taacatcaag 


1800 


agtgtatcca 


acggaagcca 


tatatggagc agcacggctg ttcttgttca 


acaatgeaag 


1860 


tggagtgaat 


gtgaaggcca 


cactcaaaat atgggagatg aattctgcct 


ttattegtec 


1920' 


tttcccattt 


gaagaaacat 


tatttcagga aatggttgct tccacatgat 


tatca 


1975 


<210> 3 
<211> 560 
<212> DNA 

<213> Gossypium hirsutum 






<400> 3 

cccgggtgat ggaactgagc attcagaaaa tagaagcctt gattaggcta 


agtacgatag 


60 


tgatgttggt tttaacagct tgtttaattg ggttggattc tcaaacaaag 


gtcatcttct 


120 


acgttcaaaa 


gaaagcttct 


ttcaaggatt tgcgtgctct tgttggattg 


ctgtatatca 


180 


cttcattggc tgctgcttat aatctacttc aactatgctg ttcttcattc 


tcagcttctt 


240 


acaaaggaac ctcgctgcaa tcttacgcat atctggcttg gcttcgttat 


attttggatc 


300 


aggcagtagt gtacgcagtg tttgcgggaa acctagcggc tttggagcat 


tcatttttgg 


360 


tattaaccgg agaagagaac ttccaatggc tcaagtggtg caataaatat 


actcgattct 


420 
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gcacccaaat cggaggatcc ttgctctgcg gcttcgttgc aagcttacta atgttttcca 480 
tcgcttccat ctccgcattc aacttgttca ggctgtattc ccccaccaag ttcatgcact 540 
tgaaactcta atcagagctc 560 

<210> 4 

<211> 1143 

<212> DNA 

<213> Gossypium hirsutura 
<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> n is a, c, g, or t 

<400> 4 



tgtgtataaa 


tatgtagaca 


cagnagatcc 


aatatatttt cactttcaca gcaaccttct 


60 


tctccactcc 


tccatctata 


tatatagtta 


gaggcagaag ggaggaagtg acattggcaa 


120 


tatggctttg 


ctgcattctt 


acttcttaag 


cgctggtgag tatgtttcag ggtccacggt 


180 


cttggcagtg 


gcaatgtcca 


ttgcttccta 


ttttctgatc cctatggtgt ttggtggtcg 


240 


ccataaaaac 


tggaagaatg 


caccaccagg 


ccctgttggt tggcctatcc ttggcagcct 


300 


tccacacctc 


tccaatcgtc 


tccatgaaga tttctttgac atggccaagg tctacggtcc 


JDU 


ccttttcagt 


ctaaacttgg 


gaataaagcc ggccatagtg gtgtcatcac cggaaatggc 


420 


agcccaagtt 


ttgaaggaaa 


aggaagggat 


gttctccagt cggaccataa ccgagaccat 


480 


tcgagtcatc 


tcttatgatg 


cccattccat 


cattttctcg ccctatggtc ccaggtggaa 


540 


ggttcttcga 


aggatcttga 


tcaccgaact 


actttctcct aaggcctttg aacaatttga 


600 


gccacttcgt 


acctcacagg 


ttcatggttt 


gctcaagtat ttgtacttgg tctcaaagtc 


660 


caacactcaa 


gtgaacatag 


cagaatatgc ttttacagca ctggccaacc tagtgagtaa 


720 


•tttcgtctgc- 


-tccaaggacc- 


ttttegacaa 


-ctcaatgcct gaaggaagaa aaatgaaaga • 


780 


gaggttctgg 


gagttgataa 


aggtgattgg. gaccccgaat ttttctgatc tcattccatt 


840 


tgttaaacca 


tttgatccac 


aaggccttga 


agagaaaaat caacaagatc tycggacagt 


900 


tggatgcttt 


ctatgagaag 


tatatcgagg 


agaagtttgc tgacaaggga aaagctcaac 


960 


ttgatgggac 


gataccctac 


caacggaaaa 


atggatatgt tagatgttct gttgagttat 


1020 


gagaagaatt 


gataaacaaa 


atgggtttgg 


acccgtttac cacaatccta tcgtccaaag 


1080 


gaatgctttt 


ctgaaaatgg 


ttaattggca 


gcgaactgaa aaacacccct caacgcacct 


1140 


ggg 








1143 



<210> 5 
<211> 1464 
<212> DNA 

<213> Gossypium hirsutura 
<400>. 5 

cccgggttgt tgtcatggct tctctgcctt tcatcttctt cttatctttc tttataatct 60 
ccacaacatt gacgtcagcc ggcgccgccg ccgccaccat caaactctcc ctctctccct 120 
tccctcaccc ttcttcctcc catccttacc aaattctcaa caacttagtc acttcttctg 180 
ttgcaagagc ccaccacctc aaacatccca aagccaaggc cgataatact acctcttctc 240 
ttctcagggc tcccctattt tctcacagtt atgggggcta cactatctcc ctcaaatttg 300 



3 



WO 2005/121364 



PCT/IL2005/000627 



gaactccgcc tcaaaccctt cctttcgtca tggacaccgg gagcagcctc tcctggttcc 360 

cttgcacctc tcgttacctt tgttcccaat gcgcattccc caatgttgac cctgcaaaaa 420 

tccccacttt tgcccctaaa ctttcatctt ccagtaagct cgtaggttgt agaaacccca 480 

agtgtagttg gctttttggc cccgacgttg agtctcgttg ccaagactgt gaacccactt 540 

ccgaaaactg cactcaaacc tgccctcctt acataattca atacggttta ggttccactg 600 

ctgggcttct attagtagaa aaccttgctt tcccccagaa aaccttccaa gatttccttg 660 

tcggatgctc catcctctcc aaccgacagc ccgctggaat agccgggttc ggtcggagcg 720 

ctgagtctat accctcccaa ttaggcctca agaaattctc ttactgtctc gtttctcgcc 780 

ggttcgatga cactggcgtc agcagcaaca tgttgttgga aaccgggtcg ggttccggtg 840 

atgccaagac cccaggcctt agctacacac cgttttacag gaaccaagtg gcttcaaacc 900 

cagttttcaa agagttctac tacgtaactc tacgtaaaat tctggtgggc gataagcacg 960 

tcaaagttcc gtacagttat ttggtcccag gatcagacgg taacggtggc accatagtgg 1020 

actcgggatc aacattcact tttatggaga gaccagtgtt cgaggtagtc tcgaaagagt 1080 

tcgagaaaca aatgggaaat tatagaagag tgcgtgaaat agaaaacaga tcgggtttag 1140 

ccccatgctt caacacttcg ggctatactt caatagaaat ccccgaattg agtttccagt 1200 

tcaaaggagg agccaaaatg gcattgcctt tggttaacta tttctcattt gacggtgatg 1260 

ataaggttgt gtgtttgatg atcgtttcaa acaatgtggt cggccaaggc tcacacagcg 1320 

gtcctgcaat aatactaggg agctttcagc agcagaatta ttacatcgaa tttgatatcg 1380 

caaacaatag gtttggatgg gctgaacgaa gctgtgcgtg agctgcactt tgttattttg 1440 

tgttttggcc atgaatatga gctc 1464 

<210> 6 
<211> 1402 
<212> DNA 

<213> Gossypium hirsutum 
<400> 6 



cccgggaaag 


gaaatggcag 


gcgtcgaagc agggaaggag 


gaggaggcga ccgctgtgag 


60 


aatcaccgga 


aaaagccacg 


tcaagccggg taagttgata 


ggaagaaaag agtgtcagtt . 


120 


ggtcacattt 


gatctcccct 


acctggcttt ctattataac 


cagaagctgc tgttttacaa 


180 


gaacgacggt 


ggtggtgagt 


tcgaggacaa ggttgaaaag ctcaagggag ggctgagggt 


240 


ggtgttagag 


gagttttatc 


agctaggagg taagctcggg aaagatgacg atggggttct 


300 


tagagttgat 


tatgacgatg 


atatggatgg tgttgaagtg gtggaagccg tggcagaggg 


360 


gattaccgtc 


gatgaattga 


ccggtgatga tggtacgagc 


tcatttaagg aattgatacc 


420 


ttttaatggc 


gtcttgaact 


tggagggtct tcacaggcct 


cttttgtcca tacagttgac 


480 


gaagttgaaa 


gatggtgttg 


caatggggtg tgctttcaac 


catgccatcc tcgacggaac 


540 


ctccacttgg 


cattttatga 


gctcttgggc tcaaatctgt 


aacggtactt cgagctccgt 


600 


cgttgtgccg 


ccgtttcttg 


atcggaccac agctcgaaac 


acccgcgtga agctcgacct 


660 


cagtccggtt 


gtttcctgca 


acggcgacga cgccaccaaa 


caaggccagc cggcgccgca 


720 


gatgagggag 


aaactcttcc 


gtttttccga agccgccgtc 


gataagatca aatcgagagt 


780 


taattcaacc 


ccaccaccgt 


ccgatggctc taaaccgttc 


tcgactttcc aatctctagc 


840 


tgtccacatt 


tggcgacacg 


tatcccaagc acgtaacctt 


aaacccgaag actacacggt 


900 
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ttttactgtc 


ttcgccgatt 


gtcgtaaaag 


ggttgatcca 


ccgatgcccg 


acagttactt 


960 


cggaaacttg attcaagcca 


tcttcaccgc 


cacagcggcc 


gggttgttat 


tggaaaaccc 


1020 


accgtcattc ggagcttcag 


tgatacaaaa 


agctatagaa 


tcccacgacg 


ctaaagccat 


1080 


cgatgaacgt aacaaggcat 


gggaagcagc 


gccgaagatt 


ttccagttca 


aagacgccgg 


1140 


tgtcaactgc gtagcggtcg 


gaagctcccc 


gaggtttaaa 


gtttacgaag 


tggatttcgg 


1200 


gtggggaaag 


ccggtagggg 


tgaggagtgg 


atccaacaac 


aggttcgatg 


gaatggtgta 


1260 


tttgtatcaa 


gggaagagcg 


gtggccggag 


cattgacgtt 


gaaatcacca 


tggaagctca 


1320 


agctatggag 


aaattggaga 


aggataaaga 


gtttttaatg 


gaagtatagt 


attttgcatg 


1380 


gaagtagggt 


actgtggatc 


tc 








1402 


<210> 7 
<211> 1480 
<212> DNA 

<213> Gossypium hirsutum 










<400> 7 
cccgggtacc 


attactctac 


tacagctgct 


atcattacta 


catacatgtc 


cacccaaagt 


60 


cgagcagttg 


gcggcaccga 


gcacaactgg 


tgccgagcgg 


tggtcggggg 


aaccgggata 


120 


gccgtcttgg 


ctatcatttc 


ttccaaaaac 


cccgacgttt 


cacatcttaa 


aaatgccctc 


180 


cacaagctcc 


aaatctccca 


tcccattctt 


aggtctcgcc 


tccattacag 


tcctactgcc 


240 


aacagttact 


ccttcgttac 


ctccccctcc 


cctttcattc 


aaatcaagta 


ctttaaccat 


300 


tctacaactt 


gtcaaatcct 


tgaaaacaac 


caaaacatct 


caccccttca 


tttgattctc 


360 


gaacacgagc 


ttaaccaaaa 


cgcttgggtt 


agttcttcat 


gtaccaccaa 


acacgacgtg 


420 


tttttcgcca 


gtgtttatgc 


cttgcctggt 


gcaacaaggt 


gggtgttggt 


gctccgccta 


480 


catgcggctg 


cttgtgaccg 


gaccacggcg 


gtgtcgttgc 


tgagagagtt 


gttgacgtta 


540 


atggctattg 


aggaggagga 


aacagggttt 


cagcaaggtc 


aaaaagaaat 


tacgatgaac 


600 


aaaggagaga 


tcagtttggc 


catggaagat 


attcttccaa 


aaggcattgt 


taagaaaaca 


660 


ctttgggcac 


gaggagtgga 


catgctaagc 


tactctgtta 


attctttaag 


gttcacgaac 


720 


ttgaggttca 


aagatgccaa 


atctcctaga tctactcaag 


tagtgaggtt 


gcttatcaac 


780 


cctgatgaca 


ctcagaagat 


cttgactggt tgcaaggcaa 


gagggattaa 


gttatgtgga 


840 


gcattaggag 


ctgccgggct 


gatttctgca cacagttcta 


aaagccgttc 


agatcatcaa 


900. 


aagaagaaat 


atggcgttgt 


aacactcaca gattgccgct 


caattcttga 


acctccgctc 


960 


tccaatcacc 


atttcggttt 


ttaccactca gctattctga 


acacgcacgc 


catcaaagga- 


1020 


ggagagaagc 


tttgggagct 


agcagagaaa gtgtacaccg 


tatttacaca 


ctacaagagc 


1080 


tgcaacaagc 


acttgtcaga 


catggcagac ctgaatttct 


taatgtgcag 


ggccatggag 


1140 


aaccctggct 


tgactccatc 


tgcctcattg aggacatgtt 


tgatatcggt 


cttcgaggat 


1200 


acggtgatag 


atgagtctag 


taaccagcaa aatcaagtcg 


gcgtagagga 


cfcafcafccrcrcra 


1260 


tgcgcttccg 


ctcatggcat 


cgcgccgtcc atcgcgatat 


tcgacaccat 


acgagatggg 


1320 


cgactggatt 


gcatttgcgt 


ttatccttcg 


ccgttgcatt 


caagggaaca 


aatgcaggag 


1380 


ctggttgata 


atatgaagtg 


catacttgtg gatgcaggga 


agaatgttgc 


tgatgaaact 


1440 


gagagttaag 


gagccagtct 


ggtctttgtc 


tgttgagctc 






1480 



<210> 8 
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<211> 720 

<212> DNA 

<213> Gossypium hirsutum 

<400> 8 



acccccgggc 


aagtgatcaa 


agagaatggg 


aagaggaaaa atagagataa agaggatcga 


60 


aaacacaaca 


aatcgtcagg 


ttaccttttg 


caaaegcagg aatggcctgc tgaagaaagc 


120 


ttacgaactg 


tcagtcctct 


gtgatgctga 


agttgctctc attgtcttct ccagtcgagg 


180 


ccgtctgtat 


gagtactcca 


acaacaacat 


aagatcaaca atagacaggt acaagaaggc 


240 


ttgctcagat 


acttctaaca 


caaacactgt 


tactgaaatc aatgctcagt attatcaaca 


300 


agaatcagcc 


aagttgagac 


agcagattca 


ddL.ytl.dCdy dd L. L-C LadL.a yyLaLUtdal 


360 


gggagattcc 


ttgagttcct 


taactgtgaa 


agagttaaag caggtagaaa acaggcttga 


420 


aagaggaatt 


actaggatca 


ggtccaagaa 


gcacgaaatg ctactagctg aaatagagtt 


480 


tttgcagaaa 


agggaaatcg 


aattggaaaa 


tgaaagtgtt tgtctccgaa ecaagattge 


540 


agaaattgag 


aggcttcagc 


aggcaaacat 


ggtgactgga cctgagctta atgetattea 


600 


agctttagct 


tctcgcaatt 


tctttagccc 


caatgtcatt gagcatccat ctgcttactc 


660 


ccatctctct 


gacaagaaga 


ttctccatct 


tgggtagagg agttggagaa agagctcatg 


720 


<210> 9 
<211> 973 
<212> DNA 

<213> Gossypium hirsutum 






<400> 9 
cccgggtccc 


tattgcatgc 


ctttcaattt 


gatttcatgg atgtaacaag cacaccaaat 


60 


agaaaagaaa 


tggatcggat 


caaaggtcca 


tggagccccg aagaagatga cttgctccag 


120 


cagctggtac 


agaaacatgg 


ccccagaaac 


tggtctttga tcagcaaatc aatccccggc 


180 


cgatccggta 


aatcctgtcg 


gctccgatgg 


tgcaatcaac tgtcaccgca agttgagcac 


240 


cgtgccttca 


ccccggaaga 


agacgagacc 


atcatccgag cacatgccag gtteggtaac 


- 300 


aagtgggcca 


ccatagcccg 


actcctcaac 


ggccgtaccg acaacgccat taaaaaccac 


360 


tggaactcca 


cgctaaaacg 


taagtgcttg 


ccggttgggg aagagtgtaa tttcgttgct 


420 


aatggagggt 


atgatggtaa 


tctgggagga 


gaggaaegge aaccgttgaa aagatcggtg 


480 


agtgctggtc 


tatacatgag 


tccagggagc 


ecategggat cggatgtgag cgattctagt 


540 


gttcccgtct 


tatcatcttc 


ttacgtgtac 


aagccgatcc caaggacegg cggtgttaac 


600 


gttgatgtaa 


atgttacgcc 


agctggagtg 


gaageggcat catcttccaa cgatccaccg 


660 


acctcactga 


gtctgtcttt 


acegggggtg 


gagtcatgtg aggtggtgtc aacccagcca 


720 


ataacggagt 


caactcagaa 


tcggagtgaa 


gaaaggggag gtggggtgat gggtttcagt 


780 


gcggagttta 


tggcggtgat 


gcaagagatg 


ataagggttg aggtgaggaa ttacatgacg 


840 


cagatgcagc 


aacagcagca 


gcagcaaaac 


ggcgcagttc egggaggage gggaatgggg 


900 


atgtgtttgg 


atggggggtt 


caggaatctt 


atggctgtga acccagtcgg gatgagtaag 


.960 


atcgagtgag 


etc 






973 



<210> 10 

<211> 1861 

<212> DNA 

<213> Gossypium hirsutum 

<400> 10 
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240 
300 



600 
660 



900 
960 



cccggggata gagagatggg aggtccaccg tacgattgct tggcgaatcc cctaggagcc 60 

gtccgattaa cattcgagaa ggcaatatgg tcagaatcgg agactcctcc gatccatccc 120 

tccgccttta acggcaaaga ttggggtgcc cttgaactct tccgccactt cctcttccaa 180 
ggatcagggc tttcccaggt tcccatcctt aatcccaaaa cattaagatg ggttcaaccc 
aacagtcttg tacgttaccg tggtatgatc caagacatgt tgggaaatga attctatgcc 

ggcgcttaca aggatggaaa tttatggcgg accaacaaat tcatggatgt ttctcaatac 360 

ccaatgggtt cctctcctga tatgtgtatt tgggaacgcc gcttgctcta ctgtgttcct 420 

gtcccaggac agaattcatg gactgaacct tctagtgaaa tggaacctaa ttggtcatct 480 

caaaccaggg agaagcggcg taggatggat gacgaagata atgatcccat ggatttggtt 540 
cctgatgatg agattaaaag ctctccaatt accaagaaga tgagagaaga tggacttcct 
tccccttcac aatccaggga tactaaaact acaagctctt cttctatcac aagtacattt 

caatctgttg acgaagataa ccttccttgc ctagtcaaga tatatgattc tccagaatca 720 

gaattgaagc tgaatgatgt ttttgaattt attggggtcc tcacttttga ttcagagctt 780 

gcagttgaga aagatgacaa tgatgagtta tcaaatagtt tctatgatga tgccctggtc 840 
catttgcccc ctaataaggt ccctcgcttg cactgtctta tacataggaa gcttgcagtg 
caggactttc tgccaggttc cccaataata gagccaaagc cacatttggt gaaagagaca 

agggaagctc tgttcaggca tcttacggct gttcttggaa atgatgaggt agctgctcat 1020 

ttcgtgttgt tgcatcttct gtccaaggtt catgctcgag tagatgatgt tgcagtgggg 1080 

aagctgtcac tcaatctaac aggtttaaac aaagaaagtg tatctgtgtt tggtactcga 1140 

cttagtgata cattcaaaaa cctcctacca ttcacgaatt gcatgcctct cacactggaa 1200 

tatctgaaca ttgcctcgct tgccccgcaa aaggattatc aagccaacag attggttcct 1260 

ggcgttcttc agctacccga gggctcacac ttgatggtag acgagacccg actagaatca 1320 

ggaagcctca atrtctactgg- aattgagaat acaaagttgc tgaaaaatct catcgagttt 1380 

caaaaagtgg agtatgactt tcaatactat aaagtggaaa tggcaacgga tgtccagtta 1440 

cttatcttct cggaggggaa atctaatatt gttcctgctg atgttattgt accttttcaa 1500 

ccttcttgtc ttgaatccac tgaaatgcca gttgctgagg cactagaagc ttggagatgg 1560 

tacttggcta ctgttagatc attaccacat tccattggat cagaaataca gaaggtggta 1620 

gaagatgatt tggttgcagc aagacaaatg gatcggagct tgggaagtcg agattttagc 1680 

agatggttga cgatggctcg gctcatatcg tcaagtttcg gagaaaccag tttgtcaaag 1740 

gaacattggg aaatggccaa agaaatggag aggctaagga gggagagact gaaatagaat 1800 

ccaaaagtcc acaagatttt gaagctttgg tatttggtaa atgattgttg ccccagagct 1860 

c 1861 

<21Q> 11 
<211> 917 
<212> DNA 

<213> Gossypium hirsutum 
<400> 11 

aaacaacatt tgtgtttcaa aaaaaactaa tccatatact gcaaaatcct tgtgcatctt 60 

cttcaaagca aacagcaaca actccaacaa atgtccatga aaaaagaagg tgaaattcta 120 

tacaaaaagg gattatgggc aatggaggaa gacaagttac tcattgatta tgtcaatgtc 180 
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catggaaaag 


gacaatggaa 


caaaatagcc 


aacagaacag 


agttgtcggc 


taaggtggat 


gaattacctg 


agtcctaacg 


gaagaagaag 


aagacctcgt 


cattagactt 


cataagctct 


ttgcgaaacg 


agttccaggt 


cgaactgaca 


atcaagtcaa 


tgaggaagaa 


actagggatc 


attgatcaaa 


acaagacaag 


caaagcaagt 


caaagtgtgt 


catgttgatg 


aggcagccac 


gaacaaccac 


tgaaaccacg 


ggtataacag 


tggatcagag 


atcatcgggt 


cttaaacaat 


actactcaag 


aatcaatgac 


ctttctggat 


tcctgaccat 


gattatgagc 


taagtacact 


atgaatgktc 


tyyttttcay 


cttarctaga 


gactatgtta 


tagatataag 


tattcatcta 


acatggcaat 


gttaaatttt 


ttgagcagtt 


catgtgtttg 


aagattaaga 


tatatctgaa 


aaccattatc 


gaattta 







<210> 12 
<211> 2024 
<212> DNA 

<213> Gossypium hirsutura 
<400> 12 

cccgggcatc tgatctaatt gttggtggac acacacacac 
atgtagcttt tagctttgaa atgtctactg gagttcaaca 
acctgaagaa acaacttgct cttgctgtga ggaacattca 
ggtccatatc aactagacaa ccaggggtgt tagaatgggg 
atataaagac aaggaaaaca gttcaatctg tagaactcaa 
agagaagtga gcaactgaga cagctttatg agtctctttc 
aagctaaacg accttcagca gcattatctc ctgaagatct 
acttggtttg tatgtcattt gtattcaaca ttggccaagg 
ctactggtca acctgtttgg ctttgtaatg ctcattgtgc 
gttcactact agctaagagt gcatcgattc agactgcagt 
gtgtggttga gctcggtgtg actgatttgg tatttgaaga 
ttaaaacttt gctcttggat gatccacagc cgattgtttc 
atgggatgaa caacgatctt gcttgtccag ctcttgatcc 
tgagtccaat attaggctgt gaacaactag aaacggtttc 
gcttggagcc taagcaatca agagaagatt cattattgat 
cttctcaagt acaaagttgg caattcatgg atgaagagtt 
ccttgaattc aagtgactgc atatctcaaa ccattgcgga 
tttaccgggg agaaaatgat aatggtttgc aagatgttga 
taacatcttt tgatcgccaa aacgatgatc ggcacttcca 
tcaagagctc acacccgttg attttaggac cacagtttcg 
gctttatcag atggcagaaa aatggcttgg tgaagcctca 
ctcaaaagtt actgaagaag atattgttct tggttcctca 



PCT/IL2005/000627 



gtttgaagag 


aagkgggaaa 


240 


ttaaaaaggg 


tgatttttct 


300 


tggaaacagg 


tggtctttga 


360 


gaattactgg 


aatagtcatt 


420 


gatcgatttt 


tgtcaaagtt 


480 


ggatccaagt 


cctggacatg 


540 


taaccagcas 


gaagtcattg 


600 


cactgagaric 


tatatcaaca 


660 


tgccatgatt 


gaccacttcc 


720 


ttarattcgg 


gttttatttt 


780 


tcaaaagatt 


tttcatgtat 


840 


acaaatgcca 


caatcaaaat 


900 






917 



acacacacac 


acacacatac 


60 


tcaagagaga 


gtaccaatga 


120 


atggagttat 


gcaattttct 


180 


agaaggttat 


tacaatggag 


240 


cactgaccaa 


ttgagtttac 


300 


agctggtgaa 


agcagtcctc 


360 


tactgatact 


gaatggtatt 


420 


attacctgga 


agaacattgt 


480 


tgacagtaaa 


gtgtttggtc 


540 


atgctttccg 


ttttcaggag 


600 


tttgagcctc 


attcagcgcg 


660 


taagagatcg 


attcaagtcg 


720 


tttgatcctt 


gccaccaaat 


780 


tcctgatgat 


agtccggacg 


840 


tgaagggata 


aatggtggag 


900 


ttgcaattgt 


gttcaccatt 


960 


tcatcgaaag 


gtcgttcctc 


1020 


agagtgcaat 


cagactaaac 


1080 


tgaagttctc 


tcggccttat 


1140 


aaactctaac 


aaggaatcga 


1200 


aaaagaaaga 


gatgaaaccc 


1260 


tatgcatgat 


agaggattga 


1320 
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ttgaatctcc 


tgaaactaat 


gctgttcgag 


atgcagcttg 


tttgcggaaa 


ccatgtgtta 


tcggagagga 


agcggaggga 


tgatgttgaa 


atcacttgtc 


cctgcaaata 


acaaggctga 


tcacgataga 


atacttacaa 


accctcgaaa 


gaagggttgc 


agtcagaagc 


aagaacgaaa 


atcgagcgaa 


catcagataa 


acaacggaaa 


gaaatcgtcc 


ctaagtaaaa 


ggaaagccta 


atcaagagat 


cggctatgtt 


gcatctaaag 


acggttcaac 


tgaacaacaa 


ggagcttcta 


atcgagttca 


agtgtccatg 


aggtaatgga 


tgcattaagc 


attctcaatt 


tggattgcca 


ctgaggggat 


tctctccctg 


accataaaat 


ccaagtacaa 


caggaccaat 


cgagcaagca 


ttgcaaagaa 


ttgctagcaa 


agattttacc 


agtttctttt 


gtaagatccc 


atgtcataag 


<210> 13 
<211> 810 
<212> DNA 

<213> Gossypium hirsutum 




<400> 13 
cccggggcta 


gctcttactc 


aaatggcaac 


caaaacgatg 


tttcttcttt 


ttgttcagtg 


tctgcaactc 


cattttcctt 


tggtggttgg 


caaactgccc 


atgccacctt 


ctacggtggt 


ggggggagct 


tgtggttatg 


gaaacctgta 


cagtcaaggg 


tttgagcact 


gcacttttca 


acaatggctt 


gagctgcggt 


caacaatgat 


cctcaatggt 


gcattagtcg 


aaccataacc 


tccacctaac 


tatgctttat 


ctagtgacaa 


tggcgggtgg 


ctttgatttg 


gccgaaccgg 


cattcttgcg 


gatagcagaa 


tgttatgttc 


agaagggtgt 


catgtgtgaa 


gaaaggaggc 


acattcgtac 


ttcaacatgg 


tgttgataac 


gaacgtggga 


agtgtccatc 


aagggttcca 


gaacaggatg 


gctacctatg 


ctggcagagc 


aatgcttacc 


ttaacggaca 


aagcctctct 


tggcaggact 


atcacagcct 


acaatgtagt 


gcctgctggt 


tgaaggaggc 


cagttttaag 


acaagatatc 





<210> 14 

<211> 21 

<212> .DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 14 

acctcccatg acccagcatt g 

<210> 15 

<211> 22 

<212> DNA 

<213> Artificial sequence 



PCT/IL2005/000627 



gagacccgaa 


gctgatgaaa 


1380 


aaaaataaac 


gaacgactta 


1440 


caaggtttct 


atactagatg 


1500 


ggaattggaa 


tcttgcagaa 


1560 


ctacggcaat 


aataaaacca 


1620 


tgatgttgtt 


gatgaagctg 


1680 


agataaagtt 


actctcagta 


1740 


gcgagaagga 


attttgcttg 


1800 


ctcagttcag 


tcatctacca 


1860 


aggatcaagt 


gttgcaaaag 


1920 


gtgttgaagc 


tatttgttct 


1980 


gtgc 




2024 



atgttgcaaa 


tatttccact 


60 


ggtgctaatg 


gagatgacaa 


120 


gctgatgcta 


ccggcacaat 


180 


tatggaacga 


gcacagcagc 


240 


gcctgctacg 


agctccggtg 


300 


gtgacagcca 


ccaacttttg 


360 


tgcaatcccc 


cacgagaaca 


420 


tatcgagctg 


gaatcgtccc 


480 


atcaggtaca 


ccatgaatgg 


540 


ggggcagggg 


atataacgtc 


600 


tccagaaatt 


ggggccaaaa 


660 


tttaaagtga 


ctgccagcga 


720 


tggcaattcg 


gacaaacttt 


780 






810 



21 
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<220> 

<223> Single strand DNA oligonucleotide 



<400> 15 

ccagcaacca ttgatatcgt ac 



<210> 16 
<211> 398 
<212> DNA 

<213> Gossypium hirsutum 
<400> 16 

cccgggtaca aacatggcta gttccggtgt 
gtgcatgacg atgatgagtg cacccaaggc 
ggtgaaccac ttgatcccgt gcttgtccta 
atgctgcagt ggggtaaaag cactctacgg 
cgtgtgcaag tgcatcaaat cggcggtgaa 
tctcgcagcc ggcctacctg ctaaatgtgg 
cactgactgc aacaaggtgc agtgaggttg 



<210> 17 

<211> 1223 

<212> DNA 

<213> Gossypium hirsutum 



ccttaagttg gtttccatga ttctcatggt 
agccaaagcc gccatcacgt gcagcgacgt 
cgtacaaaac ggcggtacac ccgctgctgc 
cgaggttcag acctccccgg accgccaaaa 
cggaattccg tacaccagca ataacctcaa 
tctccaactc ccttacagca tcagcccctc 
atgagctc 



<220> 

<221> misc_feature 
<222> (1015) (1015) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (1098) . . (1098) 

<223> n is a, c, g, or t 

<220> 

<221> mis cofeature 

<222> (1158) (1158) 

<223> n is a, c, g, or t 

<400> 17 

agctgaacac cccaaagatg gccaaccaca ccrytacctt tctccctaaa ctatccattg 
aagctatkca gacagtgact ccgatgagga taactgaacc acgacagaca cgacaagtat 
tggcagggga gcttgtagga cctgggattt tccaaaggtg tttgaacgtg gtccagtatt 
acatgaagga gaaagaagaa gactctggtt ggttattggc tgggtggatc aaggaaacac 
ttgggagagc tttacatgag caaccaatga tttctggtcg tcttcggaaa ggggaacgaa 
acgatggaga attggagatt gtttccaatg attgcggcat tagactcatt gaggcaagga 
ttcagatgaa tttgtcggat tttcttgatt tgaaacaaag ggaagatgct gaagctcagc 
ttgttttctg gaaagatatt gatgaacaaa acccacagtt ctcccccctc ttttatgttc 
aggttactaa tttccagtgt ggtggatatt caattgggat tagctgcagt attcttctag 
♦cagatctttt gttaatgaaa gagttcctta agacatgggc agatattcac aacaaggtta 
ttatcaacaa aaacgatgaa caaaagcttc ctttattota ccttcctggt ctgaaaaaca 
ccaatggtgc ctcccttaac atcatcacct caaattcaag caaaaactca gccaaaacca 
tgattttcca gatccatgct gaaactgaaa gtccagggag tgactggtgc aggaaaatgg 
cattagcctg tctggaggaa gccgagagca acctargaag tgttgtgggt ggagaatttt 
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ccttgtttgt gaacgaatcg tttgagtcca tcaaagttga aagctgctca aagcaaggga 900 

tgtcvaaaga agcagagatg ggagtcttga atcgtgcaaa atgggatgat ttgggggcta 960 

atgaagttag ttttggagat gggaataaac ctgcgcatgt ttcgtattgg cttanatcga 1020 

cgttgggtgg gcytgkcatt kgtattsctt sgcctsagga ggaaaatgca ctgtgaatat 1080 

cattggcaca gttcctgnca atgggagggg gcattgaact atcagctggg atggaaatgc 1140 

tatagaaaga aagagganat gctgatgatg ggtgcctttg ttgggccttg aatctttgga 1200 

cgttggcaag ctagaggtgc ttt 1223 

<210> 18 
<211> 1768 
<212> DNA 

<213> Gossypium hirsutum 
<400> 18 

ttatagtacc ggatactgcg cacgacacaa gccgaattca gcacgatcgt tgaaaaaata 60 

tgggtttcca aagaaacata ttgggtttcc ttttattgat attggcttca ctaacaagcc 120 

tctcttctag ccttcctagt gaatactcca tagtggaaca tgagattgac gcatttcttt 180 

cggaggaaag ggtgttggag atcttccaac agtggaaaga aaagaatcag aaagtgtacc 240 

ggcaagccga ggaggctgag aaaaggtttg aaaatttcaa ggggaatttg aagtatatcc 300 

tagagaggaa tgcaaagaga aaagcaaaca aatgggaaca ccatgtggga ttgaacaagt 360 

ttgctgatat gagcaatgag gagttcagaa aagcttactt gtcaaaggtg aaaaagccca 420 

tcaacaaagg gataaccctg tcaaggaaca tgaggagaaa ggtgcagtct tgtgatgcac 480 

cctcctcctt gaattggagg aactatggag ttgtgactgc tgtcaaggac caaggttctt 540 

gtggaagttg ttgggcattc tcatcaaccg gagccatgga aggaatcaat gccttagtta 600 

ctggagacct aattagcctt tcagaacaag aacttgtaga ttgtgatacc agcaactatg 660 
ggtgtgaagg aggatacatg gactatgctt tcgagtgggt tataaacaat ggcgggatcg - 720 

.atagcgaaac cgactacccc tacactggtg tggatggcac atgtaacacc accaaggagg 780 

aaaccaaggt tgtatctatt gatggctatc aagatgtaga gcaatcagat agtgctcttt 840 

tatgtgccgt tgctcagcaa cctgttagtg tgggaattga tggttccgcc attgattttc 900 

aactttacac tggtggaatt tatgatggga gctgctcgga tgatccagat gacattgatc 960 

atgctgtttt aatagttggt tatggttcag aaggcagtga agagtattgg atagtgaaga 1020 

attcatgggg aacaagttgg gggatagatg gatatttcta tctaaaaaga gacactgatt 1080 

taccatatgg tgtttgtgct gtcaatgcca tggcttctta tccaactaaa gaatcctctt 1140 

caccatcccc ttatccatcg ccaagtgttc ctccaccgcc acctccttca actccaccac 1200 

caccaccacc tccatctcct tcaccaagtg attgtggaga cttttcctat tgttcaagtg 1260 

atgagacatg ctgttgcctt tttgaattct atgattattg cctaatatac ggctgctgtg 1320 

aatatgaaaa tgctgtttgc tgtaccggaa ctgaatactg ctgccctagt gattacccca 1380 

tttgtgatgt ccaagaagga ctctgcctca agaacgctgg agactatctg ggagtagcag 1440 

ctaggaagcg aaaggtggct aaacacaaat taccatggac taaaatagag gaaacagaga 1500 

taacatatca gcctctgcaa tggaaaagga acccctttgc tgcaatgcgt tgaaaaaagt 1560 

gaaaaattac atatcatctc ttaaaccttg aaggttgttt tcaccttttt tctttttctt 1620 

tcatttttgc tttttcattt gcagcaagca aatccatgca gataagacta agaaaggggc 1680 
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atatttgttt agatgatgca tttgaatttg gaaactgtgt ttgtcattct tcaccagtgg 1740 

ggtataaaaa ctactatgct tttgttta 1768 

<210> 19 
<211> 1027 
<212> DNA 

<213> Gossypium hirsutum 
<400> 19 

tctagatgaa ggttctctcc ccaattcttg cttgcctagc gcttgctgtg gtggtaagcc 60 

atgctgctct ctcacctgag caatattgga gctataagct gccaaatact ccaatgccaa 120 

aggctgtcaa agaaattcta catccagaac tgatggagga gaaaagcacc tctgtaaatg 180 

taggaggtgg tggtgtaaac gtcaatacag gaaaagggaa gcctgggggt gacacccatg 240 

tgaacgttgg aggcaaagga gttggagtga acacgggaaa gccagggggt ggcactcatg 300 

tgaatgttgg agaccctttt aattacctat atgcagccag tgaaactcaa atccatgaag 3 60 

acccgaatgt ggctcttttc tttctggaaa aggatatgca ccccggggca acaatgagcc 420 

tgcatttcac tgaaaataca gagaaatcag ctttcttacc ttatcaaact gcccaaaaaa 480 

taccgttttc atctgacaag ttgccagaaa ttttcaacaa gttttcagtg aaacctggat 540 

cagtgaaggc agagatgatg aagaacacaa ttaaggagtg cgaacagcca gcgattgaag 600 

gagaggaaaa atattgtgca acctcactgg agtcaatgat tgactatagc atttccaaac 660 

tagggaaagt tgatcaggca gtctcaacag aagtggaaaa acaaacccca atgcaaaagt 720 

atacaatagc agctggagtg cagaagatga cagatgacaa agctgtagtg tgccacaagc 780 

agaattatgc atatgctgtc ttctattgcc ataaatcaga aacaacaagg gcttacatgg 840 

ttcctttaga gggtgctgac ggaacaaaag ccaaagcagt agcagtctgc cacacagata 900 

catcagcatg gaaccctaag catttggctt ttcaagtcct aaaagttgaa ccaggaacca 960 
— ttcctg-tctg- -cca-tttect-t cctcgggatc acattgtttg -ggtccctaag taaaaatcct ~ 1020 

ggagctc 1027 

<210> 20 
<211> 942 
<212> DNA 

<213> Gossypium hirsutum 
<400> 20 



cccgggcata 


cagagatgga 


gaggcaaaga 


agcaagcagg 


tttgtttgtt gatgtgggtt 


60 


ttggttgctg 


cctttttctc 


ccacaatagg 


gtcattgcag 


tgacctccac tggccttggfc 


120 


gagcagaaaa 


actactatcc 


agctcctgac 


cctcatgctg 


gaactccccc ttcaggttca 


180 


catggcacac 


caccatcttc 


aggaggtgga 


tcacctccct 


ctcatggaac cccgtcacat 


240 


ggaggtggtt 


accacccttc 


accaacacca 


tcaacgcctt 


cgggtggaaa ttgtggaact 


300 


cccccacatg 


acccttcaac 


tccatcaaca 


ccatcacaca 


ctcctcctca tggtactcca 


360 


ccatcatctg 


gaggtggtag 


tcccccatcg 


tatggaggag 


gcagtccccc atcgtatgga 


420 


ggaggcagtc 


ccccatcgta 


tggaggaggc 


agtcccccat 


catacggagg tggcagtccc 


480 


ccatcatatg 


gaggtggcag 


tccaccaact 


actcccattg 


atccaggaac tccaagcatt 


540 


ccctcacctc 


cattctttcc 


tgctccaact 


ccaccaattg 


gtggtacatg cgatttctgg 


600 


aggagtcacc 


ccacactgat 


atggggtctg 


cttggttggt 


ggggcactgt aggcaacgca 


660 
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tttggcgtga ccaacgctcc tggacttgga acaagcatga gcttgcccca agcactttca 



720 



aacacacgta ctgatggact tggggcgctt taccgggaag gaacagcctc atttctcaac 



780 



tccatggtga ataataggtt cccattctcg actaagcaag tcagggagac ttttgttgca 



840 



gcacttggtt caaacagcgc tgcagcagct caggctcgtc tcttcaagct tgccaatgaa 



900 



ggccacctca agccaaggac ctaagcaaac acctttgagc tc 



942 



<210> 21 
<211> 627 
<212> DNA 

<213> Gossypium hirsutum 
<400> 21 

cccgggagaa agatgatgaa aaggggtttt attgttttgg ccttgacggt ggttttcgcc 60 

gcgacggtgg ttacggcggc tgacgagagt gggttagcga atgagtgcag caaagatttc 120 

cagagcgtga tgacttgctt aagctttgct caaggaaaag cagcgtcgcc gtcgaaggag 180 

tgttgtaatt cagtggcggg gattaaagag aataaaccca aatgtttgtg ttatattttg 240 

caacaaacac aaacttccgg tgctcaaaat ctcaaaagct taggtgttca agaagataag 300 

ctgtttcagt taccgtcggc ttgtcaattg aagaacgcta gcgtcagtga ttgcccaaag 360 

cttcttgggt tatctccgag ctcaccagac gccgccatct tcaccaactc ctcctctaaa 420 

gcaacgacac ccagtacttc aacaaccacc gcaacgccgt cttccgcggc cgataaaacc 480 

gatagcaaat ccagtggaat caagcttggt ccccacttcg tcggttccac ggcggcgcta 540 

ctggttgcta cagcggccgt gtttttcctt gtattcccag ctggatttgc ttcaatagtt 600 

taggggtttt gcatgggatt tgatatc 627 

<210> 22 

<211> 2012 

<212> DNA 

<213> Gossypium hirsutum 



<400> "22" 
cccgggcggc 


aacttaaaag 


aaaacctttt ctttcctcat tgttttacta ctaaaatccc 


60 


ataatgccgg 


tcgtggattt 


tgtctgtgtt tttttagttt cagttgtgat gtttaatttg 


120 


agagtaagca 


cagaaccagt 


tgaagacaag caagctctac ttgctttcat ttcgggaatt 


180 


agacatgccg 


accgggttaa 


atggaattca tcgacctcag cctgtgattg gttcggtgtt 


240 


caatgcgacg 


ccaaccggtc 


tttcgtctac actttacggc tccccggtgc ggcccttatc 


300 


ggttcgattc 


cgcccaatac 


aatcggtcgg ttgaaccgac ttcgagtttt aagtctacga 


360 


gcaaaccgtt 


tgtccggtga 


gatccctgcc gatttctaca atttgactca gctgcgtagc 


420 


ctttatttgc 


aaggtaacga 


gttcaccggt ccgttcccac ctagtgtgac tcgtttaact 


480 


cgtttgactc 


gccttgatct 


ttcttctaat aatttcaccg gtccaattcc tttgggtgtc 


540 


aacaatttga 


ctcagttgac 


cagactcttc ttgcaaaata acaagttttc cggttctctc 


600 


ccgagcatcg 


actcggacgg 


tttaaacgat ttcaatgtgt ctaacaacaa ccttaaaggt 


660 


tcaatccccg 


actcgttatc 


taaattcccc gaatcttcat tcgccggaaa cattgggctt 


720 


tgcggcggtc 


cacttcggcc 


atgtaaccca tttcctccat ctccatctcc gactgagccc 


780 


attccgccca 


aaacttccgg 


tcaaagctcg aaaagccttc ccaccggcgc catcattgcc 


840 


attgccgtgg 


ggtcagcaat 


tgttgcgtta ctgttattac tattcctcat tatctgcttc 


900 


cgtaaatgga 


aacggaagtc 


accgaggcgg cagaaggcga taccatcgac gacacatgca 


960 
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gttccggtgg 


aggaggcggg 


gacttcctcg 


tcgaaagatg atataaccgg aggctcaacg 


1020 


gaaatcgaaa 


ggatgatgaa 


taataagctc 


atgttcttca 


aaggtggcgt 


ttacagtttc 


1080 


gatttggagg 


atttgatgag 


ggcgtcggct 


gaaatgttgg gaaaaggcag caccggaacg 


1140 


tcgtacaggg 


tggttttagc 


ggtggggacg 


acggtggcag 


ttaaacggtt 


gaaagacgtg 


1200 


gcggttagta 


aacgagagtt 


cgtaatgaag 


atgggaatgt 


tgggtaaaat 


catgcatgaa 


1260 


aacgtggttc 


cgttgagagc 


tttttattat 


tccgacgagg agaaattgct ggtttatgat 


1320 


tacatgcatg 


gtggaagctt 


gtttgcgctg 


cttcacggta gcagaagctc ggctcgtaca 


1380 


ccgttagaat 


gggacccccg 


gatgaaaata 


gccctaggcg 


tggctagagg 


cctcgcgcac 


1440 


ctccacagtt 


cacaaaacat 


ggtccacggc 


aacattaaat 


cttccaacat 


ccttctccga 


1500 


ccagaccacg 


aagcctgcat 


ctcagagttc 


ggtcttaact 


ctcttttcaa 


caccaacact 


1560 


ccacctagtc 


gcatcgcggg 


ttaccaagca 


cctgaagtaa 


ttcaaaccca 


taaagttacg 


1620 


qtgaagtcag 


acgtttatag 


tttcggggtg 


ttattattgg 


aattattaac 


cggtagggca 


1680 


ccaatccaac 


catcaataac 


tgaagaagcg 


ttcgatcttc 


cgcgttgggt 


ccaatccgtg 


1740 


gttcgggagg 


aatigggcggc 


cgdyy uyLtT- 


gacgcagagt 


taatggcata 


ccacgacatc 


1800 


gaggaagaaa 


tggtgcaagc 


gttacaaatt 


gcaatggttt 


gtgtctcgac 


agtgcccgat 


1860 


caaagacccg 


tcatgtcgga 


agtggtacgt 


atgatcggag 


atatgataga 


tagaggggga 


1920 


acaaatgacg 


gtacggcagc 


cgccatatga 


tccgttaaac aatgaaacaa aattcaaagt 


1980 


gtgttgacct 


tgagaagcca 


acaaaggagc 


tc 






2012 



<210> 23 

<211> 2013 

<212> DNA 

<213> Gossypium hirsutum 

<400> 23 



tactccaagc 


aagtattttc 


cttacacgtt 


tgtttttctt gtgattaatc gatatggcta 


60 


gctcaatgtc 


ccttaaactt 


gcatgtgtgg 


tggtgttgtg catggtagtg ggtgcacccc 


120 


tggctcaagg 


ggccgtaacc 


tgtggtcaag 


tcacctcgtg ccgaattcgg cacgagcttt 


.180 


ataaaagagt 


wcccaaagct 


gacaagcagc 


cataatatca tctgctaaac atatggcaga 


240 


aatgtcaacc 


ctttgtacat 


ttcttttctc 


acttctactc tttgcctcac atccccttat 


300 


cctccccact 


gctgccgacg 


gccggtggca 


gctgctacaa aaaagcattg gcatctcatc 


360 


catgcacatg 


caactcctta 


aaaatgaccg 


tgttgttatg tatgatagga ctgatttcgg 


420 


cccatccact 


ctgccattgg 


caagtgggaa 


atgccacaat gacccaacca acaccgctgt 


480 


ccaagtagat 


tgtacggcgc 


attcagttga 


gtatgatgtt ttgagcaaca agttcagggc 


540 


tcttactgtc 


caaagcaacg 


tttggtgttc 


ttcaggcggc gtcatgcctg acggtaaatt 


600 


ggtccaaacc 


ggtggtttca 


gcgaaggaga 


acttcgggtc agggttttca gcccatgcga 


660 


aagctgcgac 


tggcacgaaa 


caccaaatgg 


attggcggcc aaaagatggt atgctaccaa 


720 


tcatgtcttg 


ccagatggaa 


gacaaattgt 


tgtcggcggc cgagaacaat ttaattacga 


780 


gtttgttcct 


aaaaacatag 


ccgccgacac 


gttcaagttg catttcctgt cggaaaccaa 


840 


tgaacgagga 


gtagagaaca 


atctctaccc 


ttttgttttt ctcaatgtcg atggaaacct 


900 


gttcattttc 


gccaacaatc 


gagctatttt • gcttgattat gtgaacaaca aggtggtgaa 


960 


aacttacccc 


aaaataccag 


gtggggagcc aagaagctat ccaagcacag gttcggctgt 


1020 
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attgctacca 


ttgaagaact 


tgacagccgc 


cactattcaa gctgaagttt tagtttgtgg 


1080 


gggtgctcca 


aaaggatcat 


ttgtccaagc 


attacaaggt aagttcgtta 


aagccttgaa 


1140 


cacttgcgcc 


aggatctcaa 


taaccgaccc 


gaaaccaaaa tgggtcttgg 


aaactatgee 


1200 


tttagctaga 


gtcatgggtg 


acatggtatt 


gcttccaaac ggcaaagttt 


tggtcatcaa 


1260 


cggagcacgt 


tccgggtcag 


caggatggga 


cttaggaagg gacccggtct 


taaatccagt 


1320 


gttatacatg 


cctgataatg 


aaatcgagtc 


acgattcaag atactaaacc 


caacaaagat 


1380 


ccctcggatg 


taccattcca 


cagcagtatt 


acttcgtgat ggaagagttt 


tagtgggtgg 


1440 


aagcaatcct 


catgcgtatt 


acaactttac 


gggagtcctt taccctactg 


aactaagect 


1500 


ggaggcattc 


tatccgggtt 


atttggacgc 


caaattcaac aatttacgac ccaccattgt 


1560 


tgctcccaag 


tcaatgtccg 


gaatcagata 


caataagaaa ttaaaaatta 


aagtggtgat 


1620 


tacaggtgaa 


gtaactctaa 


acttgttgtc 


ggtgacaatg gtgtcaccag 


cttttaatac 


1680 


ccattccttc 


tctatgaatc 


aaaggttgct 


tgtacttgga aacgacaaag ttatggcatc 


1740 


cgggaaatca 


acgtatgaga 


ttgaagtgat 


gacgccaggt tcgggtaacc 


tcgcacctgc 


1800 


aggcttttat 


cttttgtttg 


tggttcatca 


agacatcccc agccaaggca 


tttgggtcca 


1860 


tttgaaatag 


tttttgattg 


atatgtttgt 


gaaattttga tcattattta 


gagaaagaaa 


1920 


tgtttatttc 


aacaatgtgg 


taaaattgtc 


ccctacatta agcaaatgta 


tttacaaatt 


1980 


tgtatcaata 


aaagaggata 


attgtttccg 


teg 




2013 



<210> 24 

<211> 1566 

<212> DNA 

<213> Gossypium hirsutum 



<400> 24 
cccgggactg 


taaaaaagca 


taggttccca 


atgeagatec tcccgtttcg 


aggaggegea 


60 


_cttg.tgtgct 


teattgette 


cttgttattt 


g.ttg.ccagtt .tttgcaatgc__cgatgctaaa . 


120 


acagtcgagg 


ttgtcggggc 


tggtgagtgt 


gcagattgtg cagagaataa 


cttggagatt 


180 


agecaggett 


tttcagggct 


acgagtaagc 


atagactgea ageccgaaaa 


tgggaagaac 


240 


tttaaaacac 


gaggatcagg 


ggagcttgat 


aaacaaggca attttaaagt 


attegttcca 


300 


gaggatttgg 


ttgaaaatgg 


ggaactgaag 


gaagaatget atgeacaget 


tcacagtgta 


360 


tcggcagcac 


cttgtcccgc 


ccatgatggc 


ttggagtcgg ccaagttagt 


gttgaagtcc 


420 


aggagtgatg 


ggaaacaegg 


gtttgggtta 


aaaggaaagc tcagattctc 


acctctaacc 


480 


tgtgcttctg 


ccttcttttg 


gcctcacttt 


aagtttcctc ccttgcctaa 


gtggaaccat 


540 


ccccctttgc 


ccaagtttcc 


acttccccca 


ttcaaaggct tccaccacca 


ttatccaata 


600 


atccctccca 


tctacaagaa 


acctcttccg 


ccaccgtccc cggtgtacaa 


gccccctcca 


660 


gttcccgtaa 


acccacctgt 


tccaatctat 


aaacctccac" cagttccagt 


ctataaacct 


720 


cctccagttc 


cagtaaaacc 


acttcctcca 


cctgttccaa tctacaaacc 


tccaccagtt 


780 


gaaaaaccac 


atcctccacc 


tgttccagtc 


tataaacctc caccagttcc 


agtatacaag 


840 


aagccatgtc 


ctccaccagt 


tccagtctat 


aaatctcctc cggttccagt 


atacaagaaa 


900 


ccgcatcctc 


ctccagttcc 


agtctataag 


aaaccacatc cacctccagt 


tccagtatac 


960 


aagaaaccat 


gtcctccccc 


agttccagtc 


tataaatctc ctccagttcc 


ggaaccacat 


1020 


cctccgccag 


ttccagtcta 


taagaaacca 


catccacctc cagttccagt 


atacaagaaa 


1080 
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CCaty v-i— 




tctcctccag 


ttccggaacc 


acatcctccg 


1140 




tccataagcc tcctccagtt 


ccggtataca 


agaaaagagt 


ccctcctccg 


1200 




acaagccccc tccagfctcct 


gta tacaaca 


aaccactacc 


tcccccggtt 


1260 


(J ci y Ly La La 




gttccaacct 


acaaaccaaa 


acccctccct 


1320 


cccattcctt 


acaagccact ccctccactt 


cccaagatcc 


ctccattccc 


taagaagcca 


1380 


tgccctcccc 


ttcctaagct acctcctctt 


cccaagattc 


ctcccaagta 


tttccaccac 


1440 


caccctcccc 


ttcctaagct acctcctctc 


cctaagattc 


ctcccaagta 


tttccaccac 


1500 


catcccaagt 


tcggaaaatg gccttctttg 


ccaccctttg 


ctccccatca 


tccttaagct 


1560 


gagctc 










1566 



<210> 25 

<211> 689 

<212> DNA 

<213> Gossypium hirsuturo 



<400> 25 



cccggggatg 


gtgagaggca 


aaattcagat 


gaagcgaatt 


gaaaatgcaa 


cgagccggca 


60 


agtcaccttc 


tctaagcgac 


gaaacgggtt 


gttgaagaag 


gcttatgaac 


tatatgttct 


120 


atgcgatgct 


gaagtagctg 


tcatcatatt 


ttctcataaa 


ggaaaactct 


acgagttctc 


180 


aagttccgac 


aacatgcaaa 


acaccataga 


acgataccgt 


cagtacaaga 


aagatgtcca 


240 


gagtaacatc 


cctgaatttg 


acagatacac 


acagcaacta 


aggcttgaag 


cagaaaatat 


300 


ggccaagaag 


attgagttcc 


ttgaggtttc 


taaaaggaga 


atgttgggtc 


aaaatcttgg 


360 


ttcttgttct 


atagatgaac 


ttcaagaggt 


tgaaaaccag 


cttgaacgca 


gcttaagaaa 


420 


cattagggca 


agaaagggct 


atttattcaa 


ggagcagata 


ctgcaactaa 


aagctaagga 


480 


aagatatatg 


caagaggaga 


atgccaagtt 


atctgctaag 


aacaatggta 


caacatgcag 


540 


ccajjcagaac 


gcggaggtgg 


agacagaact 


gttcctcggg 


ttgcccgaaa 


accgctgttc 


600 


ctagcaggta 


ggtctttgga 


tatggaatga 


aaatgatatt 


ccctattgga 


ataatgcttg 


660 


cttgtacgtt 


atcgccattg 


ctagagctc 








689 



<210> 26 

<211> 258 

<212> PRT 

<213> Gossypium hirsutum 

<400> 26 

Met Ala Thr Lys Thr Met Met Leu Gin He Phe Pro Leu Phe Phe Phe 
1.5 10 15 

Leu Phe Ser Val Cys Asn Ser He Phe Leu Gly Ala Asn Gly Asp Asp 
20 25 30 

Asn Gly Gly Trp Gin Thr Ala His Ala Thr Phe Tyr Gly Gly Ala Asp 
35 40 45 

Ala Thr Gly Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn Leu Tyr Ser 
50 55 60 

Gin Gly Tyr Gly Thr Ser Thr Ala Ala Leu Ser Thr Ala Leu Phe Asn 
65 70 75 80 
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Asn Gly Leu Ser Cys Gly Ala Cys Tyr Glu Leu Arg Cys Asn Asn Asp 
85 90 95 

Pro Gin Trp Cys lie Ser Arg Thr lie Thr Val Thr Ala Thr Asn Phe 
100 105 110 

Cys Pro Pro Asn Tyr Ala Leu Ser Ser Asp Asn Gly Gly Trp Cys Asn 
115 120 125 

Pro Pro Arg Glu His Phe Asp Leu Ala Glu Pro Ala Phe Leu Arg lie 
130 135 140 

Ala Glu Tyr Arg Ala Gly lie Val Pro Val Met Phe Arg Arg Val Ser 
145 "* 150 155 160 

Cys Val Lys Lys Gly Gly lie Arg Tyr Thr Met Asn Gly His Ser Tyr 
165 170 175 

Phe Asn Met Val Leu lie Thr Asn Val Gly Gly Ala Gly Asp lie Thr 
180 185 190 

Ser Val Ser lie Lys Gly Ser Arg Thr Gly Trp Leu Pro Met Ser Arg 
195 200 205 

Asn Trp Gly Gin Asn Trp Gin Ser Asn Ala Tyr Leu Asn Gly Gin Ser 
210 215 220 

Leu Ser Phe Lys Val Thr Ala Ser Asp Gly Arg Thr lie Thr Ala Tyr 
225 230 235 240 

Asn Val Val Pro Ala Gly Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly 



<210> 27 
<211> 1432 
<212> DNA 

<213> Gossypium hirsutum 
<400> 27 

cccgggttcc atgaaaaggg tctcgaacat ctatatagag gcaaagaaac atctctctga 60 

tctgctaaca aacgaggatc aaaatgagga tcttttgagt acacaggttc caaaaacctt 120 

aggtaggatt ctttctcttc ctgagtataa tacatcccct gtcagcagcc ctggtcagaa 180 

cttggagcat agttttacaa ctgcgcatat gagatttgca ggctcggaca aattgcaaat 240 

ggtgagtgaa aatgatcggt ttgtcagcct tctaagtatg agggcagaga agaccgatgg 300 

ccagctttgc atttctgaaa acaaaagcga taatgaagtt gaaagtgata atgcaatttc 360 

aaacaacctt gacactagtg tgaataatga caaagaggat ccaatttttt gttctataaa 420 

agatgaattg agttccaaag agtctgtgag tattgttaaa gctactgaaa tgatggttca 480 

tgaagaaagc aagtccctgg atatttcttc agagacgagc ggctcttcaa ttatcacaga 540 

tgataaaaat gttgacatat atgaagtttg tgatgaaaaa caaaatcctt ggtacttgaa 600 

acaggattca tcggaagtigg accaacagcc tttttctcca ttatcatctc catcagactc 660 
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atcagtcatg 


aaaaaggttg 


aatgtttgga 


gagtgttact 


ccccgtatct 


gttcttgagc 


caatatttgc 


agatgatctt 


atcttattcc 


ggtgaaacat 


ccattcaacc 


gctaagaatt 


tttggccaca 


aatcagagca 


atcgaattaa 


aacttgtatg 


tgagcacata 


aaggcagtgc 


tgcaagcctc 


gagtttcagc 


gtcactttct 


tcagacctgc 


ttatcgaccc 


attgttggtt 


caaccagctt 


tgtcaagacc 


aaaaactgct 


ctttgattgc 


ggtttgtgag 


tactattttg 


gttcccctag 


tgtttcattt 


tatcccaaac 


atgcaaaata 


caattcaaga 


agtctgggag 


cccgactcca 


ttgccttgta 


ctctggacct 


ggtagtccga 


aacatggatg 


gaccttcaac 


ttgacactgg 


ttatattggt 


cttcgaagat 


ttagtggaag 


acaccataac 


aagctacata 


atataatgtg 


cttccagctt 


agcttagctg 


aaggacaacg 



<210> 28 

<211> 1079 

<212> DNA 

<213> Gossypium hirsutum 



<400> 28 
ggattccggt 


gacggtgttt 


ctcacaccgt 


gccaatctat 


tgccatcctc 


cgtcttgacc 


ttgcaggtcg 


tgatctaacc 


taccgagaga 


ggttacatgt 


tcaccaccac 


tgctgaacgg 


ggagaagctt 


gcttatgttg 


ccctggacta 


tgagcaggaa 


ctcatctgtt 


gagaagaact 


atgagttgcc 


tgacggacaa 


gagattccgt 


tgcccggaag 


tcctcttcca 


gccatctttc 


aatccatgaa 


actacctaca 


actctatcat 


gaagtgcgat 


ctacggtaac 


attgtgctca 


gtgggggttc 


aaccatgttc 


gagcaaggag 


atcactgctc 


ttgctccaag 


cagcatgaag 


agagagaaag 


tacagtgtct 


ggattggagg 


atctatcttg 


gcagatgtgg 


atttccaagg 


gtgagtatga 


tgaatccggt 


gtgcttctaa 


gttttgtaat 


tgcttttgat 


ggtgatctac 


tttttgtgtg 


cgatgttaag 


tgaagtccaa 


agtctggttt 


cattgtagga 


tggtgtactt 


gatattgacg 


tattattatt 


caccattaag 


atgatgggtc 


ctatggagat 


ggcggtgggc 


cttccttata 


atccatcttt 


gaaccatgtt 


gcttaaaagg 


gattgtggtg 


cttttttatt 


ttattttatt 


atttaatatt 


atgttcatag 


ctattattgt 


acgagatttt 


ttttgaaaaa 



<210> 29 
<211> 657 
<212> DNA 

<213> Gossypium hirsutum 
<400> 2$ 

cggccgcgtc gactttttta ataaaactga aacaaccctt 



PCT/H.2005/000627 



gatataccag 


agcgatcaag 


720 


atcagccctg 


caagcatcag 


780 


cgattcgaag 


aacatgactc 


840 


aatgataagg 


aatcaatatt 


900 


tgggacgaag 


tctacatccg 


960 


gacgaggtcg 


aatacttgcc 


1020 


attaatgaag 


tagtcagaga 


1080 


gttaaaccca 


atatccgtcc 


1140 


ggagtttatt 


ggcatttgct 


1200 


aaagacctgg 


ctaagactgg 


12 60 


gttgagattg 


gtgaagccat 


1320 


aatggaagtt 


gggaatgtga 


1380 


acgatagagc 


tc 


1432 


gaaggatatg 


cccttccaca 


60 


gatgccttga 


tgaagattct 


120 


gaaattgtcc 


gtgacatgaa 


180 


ctggagactg 


ccaagagcag 


240 


gtcattacta 


ttggagctga 


300 


atcgggatgg 


aagctgctgg 


360 


gtggatatca 


ggaaggatct 


420 


cctggtattg 


cagaccgcat 


480 


attaaggtcg 


ttgcgccacc 


540 


gcatcactca 


gcaccttcca 


600 


ccatccattg 


tccacaggaa 


660 


attttgcatt 


tagttggctt 


720 


atgtggggag 


agttagggat 


780 


ttagcctttc 


accgtatcac 


840 


ggacaattgg 


tgcttaattc 


900 


atgtttggag 


ctggagactg 


960 


caagggtttt 


gagaacatta 


1020 


ttagagtcag 


tttgcggtc 


1079 


cttcacacaa 


aaatatgctc 


60 
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cgtacaattc 


gtttattctc 


tcatagcatc 


tcttctaaat 


tacacaaagc 


acaatcactc 


actcacttaa 


gccttagaca 


acacgggaac 


tgtaacccca 


ttcgttgtca 


taccaagtca 


ttcaaagcaa 


ttccagcctt 


ggcatcaaaa 


atgcttgacc 


gttgagacta 


aatcttcgtc 


cacataacca 


agaattccct 


gccttgatag 


cagccttaat 


ttcatcatat 


gtagccttct 


tcaaccacag 


agacatcaac 


agtgggaaca 


cggaaagcca 


agtgctggca 


aaactttgcc 


gacggccttg 


gcagctccag 


aaggaagcag 


ctctaccacc 


tctccagtcc 


ttcatggaat 


gtagcagtaa 


tagaatgaac 


agtggtcata 


agaccctcaa 



<210> 30 

<211> 649 

<212> DNA 

<213> Gossypium hirsutum 



<220> 

<221> misc_feature 

<222> (423) . . (423) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (4.92) . . (492) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (504) . . (504) 

<223> n is a, c, g, or t 

<220> 

-<221>. misc^feature .... - 

<222> (627) . . (627) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (630) . . (630) 

<223> n is a, c, g, or t 

<400> 30 

gagaaaagga agacaacgat ggtgtctctg aagttacaga 
ctgaagtgtg gccgtggcaa ggtttggctt gatcctaatg 
gccaactcca ggcagaatgt taggaaactt gttaaggatg 
accaagattc actcccgatc tcgtgcacgc agaatgaaag 
cattctggct atggtaagag gaagggtacc agggaggcaa 
tggatgagga gaatgcgagt actaaggcgt ttgcttcgta 
attgacaagc acatgtacca tgacatgtac atgaaggtga 
aancgtgtct tgatggaaag catccacaag tccaaggctg 
ctctcaaatc antttgaggc caancgaact aaaaacaagg 
gccagaaagg aaaaacgcct tgcacaggga cctggtgtga 
ccgcaacagg ccgaaggagt taaaaantcn aagaaatgaa 



PCT7IL2005/000627 



taaaattgct 


aaagcttgga 


120 


tgtgtcggat 


caagtcgata 


180 


caagcttaac 


aaagttgtca 


240 


tgttgtctcc 


gatgaagtca 


300 


tcaagttgcc 


ttcagattcc 


360 


tctcaagtct 


cacagtgagg 


420 


ttccagtcag 


cttgccattc 


480 


tgctgctagg 


aatgatattg 


540 


gaccatcaac 


agtcttttgt 


600 


cgatgccaaa 


tttatca 


657 


agcggctcgc 


cgctagcgtc 


60 


aaatcaatga 


aatctccatg 


120 


gttttatcat 


ccggaagcct 


180 


aggccaagag 


aaagggtcgt 


240 


gattgcctac 


aaagatcctt 


300 


agtacaggga 


atccaagaag 


360 


agggtaatgt 


atttaaaaac 


420 


agaaggcaaa 


aaaaaaaaca 


480 


cgagcaggga 


gagaaagatg 


540 


aagcagcacc 


tgcagctgca 


600 


tgaggtact 




649 
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<210> 31 

<211> 877 

<212> DNA 

<213> Cauliflower mosaic virus 

<400> 31 



aagcttgcat 


gcctgcaggt 


ccccagatta 


gccttttcaa tttcagaaag aatgctaacc 


60 


cacagatggt 


tagagaggct 


tacgcagcag 


gtctcatcaa gacgatctac ccgagcaata 


120 


atctccagga 


aatcaaatac 


cttcccaaga 


aggttaaaga tgcagtcaaa agattcagga 


180 


ctaactgcat 


caagaacaca 


gagaaagata 


tatttctcaa gatcagaagt actattccag 


240 


tatggacgat 


tcaaggcttg 


cttcacaaac 


caaggcaagt aatagagatt ggagtctcta 


300 


aaaaggtagt 


tcccactgaa 


tcaaaggcca 


tggagtcaaa gattcaaata gaggacctaa 


360 


cagaactcgc 


cgtaaagact 


ggcgaacagt 


tcatacagag tctcttacga ctcaatgaca 


420 


agaagaaaat 


cttcgtcaac 


atggtggagc 


acgacacact tgtctactcc aaaaatatca 


480 


aagatacagt 


ctcagaagac 


caaagggcaa 


ttgagacttt tcaacaaagg gtaatatccg 


540 


gaaacctcct 


cggattccat 


tgcccagcta 


tctgtcactt tattgtgaag atagtggaaa 


600 


aggaaggtgg 


ctcctacaaa 


tgccatcatt 


gcgataaagg aaaggccatc gttgaagatg 


660 


cctctgccga 


cagtggtccc 


aaagatggac 


ccccacccac gaggagcatc gtggaaaaag 


720 


aagacgttcc 


aaccacgtct 


tcaaagcaag 


tggattgatg tgatatctcc actgacgtaa 


780 


gggatgacgc 


acaatcccac 


tatccttcgc 


aagacccttc ctctatataa ggaagttcat 


840 


ttcatttgga 


gagaacacgg 


gggactctag 


aggatcc 


877 



<210> 32 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 32 

acccgggatg gatggttatt gtagcagaag g 



<210> 33 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 33 

gccgagctcg aatcaaatga gggcaatgcc 



<210> 34 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 34 

aatctagaca agtacagaag ctcaattccc 30 



<210> 35 

<211> 22 

<212> DNA 

<213> Artificial sequence 



20 



WO 2005/121364 



PCT/IL2005/000627 



<220> 

<223> Single strand DNA oligonucleotide 



<400> 35 

tgataatcat gtggaagcaa cc 



22 



<210> 36 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 36 

cagcccgggt gatggaactg agcattcag 29 



<210> 37 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<210> 38 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 38 

tttcccgggt tgttgtcatg gcttctctgc 30 



-<210>- -39 - - 

<211> 28 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<210> 40 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 40 

gcacccggga aaggaaatgg caggcgtc 28 

<210> 41 

<2U> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 37 

cgtgagctct gattagagtt tcaagtgcat g 



31 



<400> 39 

atggagctca tattcatggc caaaacac 



28 



<400> 41 
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tttcgatatc cacagtaccc tacttccatg c 31 

<210> 42 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 42 

tacccgggta ccattactct actacagctg c 31 

<210> 43 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 43 

gagagctcaa cagacaaaga ccagactgg 29 

<210> 44 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 44 

acccccgggc aagtgatcaa agagaatgg 29 

<210> 45 
<211> 29 
<212> DNA 

<213> Artificial sequence 

-<220> - - - - ' • - - — 

<223> Single strand DNA oligonucleotide 

<400> 45 

catgagctct ttctccaact cctctaccc 29 

<210> 46 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 46 

cccccgggtc cctattgcat gcctttc 27 

• <210> 47 

<211> 26 

<212> DNA 

• <213> Artificial sequence 

. <220>- 

<223> Single strand DNA oligonucleotide 
<400> 47 

ttgagctcac tcgatcttac tcatcc 26 

<210> 48 
<211> 29 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 
<400> 48 

agcccgggag atagagagat gggaggtcc 29 



<210> 


49 


<211> 


28 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Single strand DNA oligonucleotide 


<400> 


49 


tcgagctctg gggcaacaat catttacc 


<210> 


50 


<211> 


30 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Single strand DNA oligonucleotide 



. <400> 50 
tccccgggca tctgatctaa ttgttggtgg 



<210> 51 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 51 

ttggatatcg caccttatga catgggatc 29 

<210> 52 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 52 

ttcccgggta caaacatggc tagttccg 28 



<210> 53 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 53 

tcgagctcat caacctcact gcaccttg 28 

<210> 54 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 
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<400> 54 

tagtcactcc tgttctagat gaag 24 

<210> 55 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 55 

ctgagctcca ggatttttac ttagggaccc 30 

<210> 56 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 56 

tacccgggca tacagagatg gagaggc 27 

<210> 57 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 57 

acgagctcaa aggtgtttgc ttaggtcc 28 

<210> 58 

. <211> 28 

<212> DNA 

-<-213> Artificial- sequence - 



<220> 

<223> Single strand DNA oligonucleotide 

<400> 58 

agcccgggag aaagatgatg aaaagggg 28 

<210> 59 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> "Single strand DNA oligonucleotide 

<400> 59 

aagatatcaa atcccatgca aaacccc 27 

<210> 60 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 60 

aacccgggcg gcaacttaaa agaaaacc 28 
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<210> 61 

<211> 26 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400> 61 

aagagctcct ttgttggctt ctcaag 26 



<210> 62 

<211> 26 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400> 62 

gacccgggac tgtaaaaaag catagg 26 



<210> 63 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400> 63 

gcgagctcag cttaaggatg atggggag 28 



<210> 64 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400>- -64 

atcccgggga tggtgagagg caaaattc 



28 



<210> 65 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 65 

acgagctcta gcaatggcga taacgtac 28 



<210> 66 

<211> 27 • 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 66 

atcccgggtt ccatgaaaag ggtctcg 27 



<210> 67 

<211> 28 

<212> DNA 

<213> Artificial 



sequence 
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<220> 

<223> Single strand DNA oligonucleotide 
<400> 67 

gtgagctcta tcgtcgttgt ccttcagc 28 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400> 68 

tcgtgatcta accgatgcct t 21 



<210> 69 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 69 

acaatttccc gttcagcagt g 



21 



<210> 70 

<211> 22 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400> 70 

ggtggctcct acaaatgcca tc 22 



<210> 71 

<211> 20 

<2i2> DNA" 

<213> Artificial sequence 



<220> 

<223> Single strand DNA oligonucleotide 
<400>- 71 

aagttgggta acgccagggt 20 



<210> 72 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 72 

ctattcggct atgactgggc 



20 



<210> 73 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 73 

atgtcctgat agcggtccgc 



20 
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<210> 74 

<211> 1344 

<212> DNA 

<213> Gossypium hirsutum 



<400> 74 
tgctcccttg 


ctacaccacc 


ctcggaggag 


gagcagcaga 


cccttaaagg 


gtttcgcagt 


60 


gatatttggg 


tccgtcgttt 


tcctactctc 


actggtcata 


ttaatcgtta 


accaaagccc 


120 


•ggagccatta 


gcaagtaacc 


ccagtagtgt 


aacggaggca 


gggtcgtatt 


caatggcggc 


180 


gcagccaaga 


gggatagctg 


aaggagtttc 


agccaagtca 


aacccatcac 


tttttgacaa 


240 


agttgggttt 


aattggacaa 


acgctatgtt 


ttactggcaa 


agaactgcct 


accactttca 


300 


gcctcaaaag 


aattggatga 


atggtaggtt 


acctaattat 


aatttaagtt 


acttcttttg 


360 


attttcgtca 


ctaaaccttt 


cattttagtt 


tactattttt 


aaacttaaat 


atgatttcct 


420 


tttaataatc 


gaatttaatt 


tgttgtcttt 


ttcttactat 


gcttgcacgt 


tggttcggca 


480 


caacttacgt 


atcttgcttc 


agatcctgac 


ggtgagttct 


catcacatct aaattcttgt 


540 


tgggacaata 


ctgttagtca 


accatttcat 


caatcaatgc gtaaaacaca aaaatatcga 


600 


atcagaaatt 


tgtgaccaac 


ccaatctgct 


agttcttcca 


aatttgagca 


tttcaacctt 


660 


gatttgcaat 


taaagttagc 


ttctacattg 


aattgaatca 


tatcttaccc 


tttttcttct 


720 


actagatcca 


cttataattt 


tatttttcaa 


tactcattta attaaagtaa ataatttaaa 


780 


taatttgttt 


catataaaat 


atatatattc 


tacatcaata 


agatactaat atcgaatcca 


840 


ccatttgtgg 


tataataaat 


gcaattatat 


tacaaaaaag 


ttaataaaat attagtagca 


900 


tagaattaat 


taatttaaaa 


aaatatgatt 


tttttagcag 


aattaaaaaa 


aacaaatatc 


960 


ttataaaaaa 


aataaatatt 


aaaagaaaaa 


agacatatga 


taacccttag 


tttacaatct 


1020 


ataagttaca 


aaaaaatagt 


tacttgaccg tttggtttgt 


ttacctgtcg ttctaacgtt 


1080 


-- taagtectaa 


etaaetagtt 


ttgcaaaacc 


-ttgettctgt- 


acatcaccat gtaatagcat 


1140- 


gtggtttttt 


tagtaattat attaaactct 


aatagtttaa 


ttaaagtagt 


atgtgacata 


1200 


atggaacaaa 


aatacgatgg tcgcaggtcc 


gttatatcac aagggatggt 


accatctttt 


1260 


ctatcaatac 


aaccctgatt cagccatatg gggcaacatc acttggggcc acgctgtatc 


1320 


aaaggacctc 


attcacggtt ctat 








1344 



<210> 75 
<211> 556 
<212> DNA 

<213> Gossypium hirsutum 
<400> 75 

tccacaacat tgacgtcagc cggcgccgcc gccgccacca tcaaactctc cctctctccc 60 
ttccctcacc cttcttcctc ccatccttac caaattctca acaacttagt cacttcttct 120 
gttgcaagag cccaccacct caaacayccc aaagccaagg ccgataatac tacctcttct 180 
cttctcaggg ctyccctatt tyctcacagt tatgggggct acactatctc cctcaaattt 240 
ggaactccgc ctcaaaccct tcctttcrty atggacacyg ggagcagcct ctcctggttc 300 
ccttgcacct ctcgttacct ttgttcccaa tgcgcwttcc ccaatgttga ccctgcaaaa 360 
atccccactt ttgcccctaa ackttcatct tccarkaagc tcgtaggttg yagaaacccc 420 
aagtgtagtt ggctttttgg ccccgacgtt gagtctcgtt gccaagactg tgaacccact 480 
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tccgaaaact gcactcaaac ytgccctcct tacataattc aatacggttt aggttccact 540 
gctgggcttc tattag 556 

<210> 76 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 76 

tttttttttt tgtttgttgt gggggtgt 28 

<210> 77 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 77 

tttttgtttg ttgtggg 17 

<210> 78 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 78 

tttttttttt tgtttgttgt gggggtgt 28 

<210> 79 

<211> 20 

<212> DNA 

-<-213> "Artificial- sequence- - - - ■- 

<220> 

<223> Single strand DNA oligonucleotide 

<400> 79 

ctggggttac ttgctaatgg 20 

<210> 80 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 80 

tttttgtttg ttgtggg 17 

<210> 81 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 81 

gctccgggct ttggttaacg '20 
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<210> 82 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 82 

ggctttggga tgtttgaggt gg 22 

<210> 83 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 83 

tttttgtttg ttgtggg 17 

<210> 84 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 84 

ggtggtgggc tcttgcaaca g 21 

<210> 85 

<211> 969 

<212> DNA 

<213> Gossypium hirsutum 



<400> 85 
ggtggtgggc 


tcttgcaaca 


gaggcaattt 


tctaccggaa atgaaggaag gaatgaagga 


60 


-ggcggtagag 


agaagagcac 


agaaaatgtc 


-ttacagaaat- tgaaagggta. agacattttc 


. . 120 


cttaaaatgt 


aacacatttt 


ctcttgtttt 


ggagttcatt ttccaaatgg aaaatgtttt 


180 


ccgccaatca 


aaagctgaaa 


aagttaaaaa 


tgattttctg gaaaatcaat tccgtcaatc 


240 


aaacagaccc 


ttagtctatt 


ctctccaatt 


aaatcattct tagtccttat acttttttaa 


300 


atttctatct 


cgatacaaaa 


gacaaccatt 


gaatctatta aattaccttt gtgtaaatga 


360 


tatatgaaaa 


taataaattg 


atatgacata 


acgcatgcga taatatatgt aaaaatcacc 


420 


aattacaggt 


acaaaaaaat 


ggttatggac 


taaatccgta acttgcgcat gataaacgaa 


480 


gtggcataat 


ggataattca 


gtgttttaca 


atgtcaaaat agcagcaccg taatcgaaca 


540 


tgataccttg 


gtccagttgt 


gctgtttacc 


gttggtatag tatttctact ctctctctat 


600 


aaagagagaa 


cgggacaaac 


atcatcccca 


ccgctatgcc tattccccca • ctcaaattca 


660 


tttcactttt 


aaataccaat 


taatattact 


tacacttact tcccctttac aaatagataa 


720 


tccaaaagca 


gagcaaaaac 


agagataacc 


attctttttc ttttgttgtt gttgttgtcg 


780 


tcgtcatggc 


ttctcttcct 


ttcatcttct 


tcttatcttt ctttataatc tccacaacat 


840 


tgacgtcagc 


cggcgccgcc 


gccgccacca 


tcaaactctc cctctctccc ttccctcacc 


900 


cttcttcctc 


ccatccttac 


caaattctca 


acaacttagt cacttcttct gttgcaagag 


960 


cccaccacc 








969 
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<210> 86 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 86 

attcaagctt tttttgtttg ttgtggggg 



<210> 87 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 87 

ttggatcctt gggcattgag cttctgtac 



<210> 88 

<211> 1040 

<212> DNA 

<213> Gossypium hirsutum 

<400> 88 



aagctttttt 


tgtttgttgt 


gggggtgtta 


tttgaagtag 


aattgaagca 


gctcctttaa 


tttagttttg 


ttggttgtca 


gggtttgaat 


aatgcaatgc 


aattcaagac 


ccaatgatcc 


cactcaaatc 


ccaacaacca 


ttccatcctg 


atggatgtgg 


agcctctcaa 


acttagagct 


cttgccacag 


cacatgatgc 


aagtacaagg 


acaattaatc ctagattatc 


tcaacagcat 


tttcgtatac 


atatgtctgc catttaattt 


aaaggtaaac 


tgccttattc 


acctcacaaa- 


tcagtatcca 


iraaactagct 


aacatgctca 


agacttggca 


ttccctaatg 


ctgtgtgtcc 


tggctctgtc 


ttcatgcttc caaattatta 


ttattaatga 


gaaatcttgc 


aacgcaagcc 


acaacccaga 


agctagagaa 


tttataacta 


tatgtatagt 


agtgtaaatg 


gcaatatata 


aaaagcaagc 


aaatgagttt 


gactaccagg 


tgcagctgca 


cctacctttt 


caactgtccc 


tcttgtttca 


ctgtatagca 


gggaccacct 


ctctctccca 


gctaaattgg 


acaacaacca 


aatctcttct 


ctttctctct 


atgttgttct 


ctctttaatt 


ctacttaatc 


tctcattgct 


tacttatatt 


gtaagtgtga 


agctcaatgc 


ccaaggatcc 







<210> 89 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 89 

ttaaagcttt gggctcttgc aacagaggc 



PCT7IL2005/000627 



29 



29 



catagcattt aaatcgaatt 


60 


atgccaataa aaaggagaca 


120 


taacaaacat tcaaggagca 


180 


aaaagcaaat ttaattaagg 


240 


atttttcaac agatcagaac 


300 


gccacatgac ccatgttcca 


360 


atttgtgatg ccaatgccaa 


420 


gttttcaggc caggaggacc - 


480- 


attggtcatt gcacgtaaat 


540 


agaaaaataa tttactctct 


600 


gacaaataat acgatgataa 


660 


ttaatataaa atcctacccc 


720 


tgcatgcatg catgggatgc 


780 


ttcaccagat ctgatctaat 


840 


atccaagctc aagacatata 


900 


ttacctacca ttaccctttt 


960 


ccaagtaaac caagtacaga 


1020 




1040 




29 
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<210> 90 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 90 

aaggatccga cgacgacaac aacaacaac 29 

<210> 91 
<211> 792 
<212> DNA 

<213> Gossypium hirsutum 
<400> 91 

aagctttggg ctcttgcaac agaggcaatt ttctaccgga aatgaaggaa ggaatgaagg 60. 

aggcggtaga gagaagagca cagaaaatgt cttacagaaa ttgaaagggt aagacatttt 120 

ccttaaaatg taacacattt tctcttgttt tggagttcat tttccaaatg gaaaatgttt 180 

tccgccaatc aaaagctgaa aaagttaaaa atgattttct ggaaaatcaa ttccgtcaat 240 

caaacagacc cttagtctat tctctccaat taaatcattc ttagtcctta tactttttta 300 

aatttctatc tcgatacaaa agacaaccat tgaatctatt aaattacctt tgtgtaaatg 360 

atatatgaaa ataataaatt gatatgacat aacgcatgcg ataatatatg taaaaatcac 420 

caattacagg tacaaaaaaa tggttatgga ctaaatccgt aacttgcgca tgataaacga 480 

agtggcataa tggataattc agtgttttac aatgtcaaaa tagcagcacc gtaatcgaac 540 

atgatacctt ggtccagttg tgctgtttac cgttggtata gtatttctac tctctctcta 600 

taaagagaga acgggacaaa catcatcccc accgctatgc ctattccccc actcaaattc 660 

atttcacttt taaataccaa ttaatattac ttacacttac ttccccttta caaatagata 720 

atccaaaagc- agagcaaaaa - eagagataac cattcttttt cttttgttg-t tgttgttgtc- -780 

gtcgtcggat cc 792 

<210> 92 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

. <223> Single strand DNA oligonucleotide 

<400> 92 

aaatctagat aagttgataa agctaatttc tc 32 

<210> 93 

<211> 23 

<212> DNA 

' <213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

•<400> 93 

tttcccggga cctggaggca ate 23 

<210> 94 
<211> 1969 
<212> DNA ' 

<213> Nicotiana tabacum 
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<400> 94 

tctagataag ttgataaagc taatttctca ttttagctac catcgctagt aatcgtggca 60 

ataactaccc taactatagc atttattgct accaaataaa atttggcagc taatcataat 120 

tttttgtcat gaatcaatag ttattgtagc aatagttatc tcttagccac aataaattat 180 

ttaaaataaa atattatagc taaataaata tttttgcttt aagttctaaa agcttgtggc 240 

aatagttaaa tgatatagtc acagatttat tggtataatt gaattatgtt gctaatttct 300 

tagttttttg ccacgagtta aaaattacca atagctatag taacttttta atcacaataa 360 

aatatttgaa agaaaatatt gtagctaaat gaatattttt tccttcaagt tattaaaagt 420 

tgtggcaata taggttaaat tagccacatg tttcttgctt taatagaatt ttgtagctaa 480 

tcattaactt ttaccacgag ttgaacttaa tataacaaca ataacctttt aaccataata 540 

aagcgattta aatcaaatat tactaaataa ataactttgc tttcaagttt ctataaaatc 600 

atggcaatag tcattacgat aaaatgatat aaccacgaat atattgcaac gataaattct 660 

gtaactaatc attagttttt gcgacgaggt aaattttccg tcacagtagc aatcttctag 720 

gcacattaaa aatttgaaac aaaattttgt agtcaaataa atatttatct tcttatttta 780 

agaaaataaa aatagttaga taatagttac tactatttgt catgaaaata tcaatagata • 840 

caaatttaaa gtgactataa atttacgagt ttactatact ttagtcgtac agtttgcaat 900 

aatagtattt taaccacaat tagttatatg tacaaaataa cataagtgaa taactttttt 960 

tcaatgagaa aataagagtt gctcaaacaa tatcaagtta caaaaattta attttaactg 1020 

taaaagttat atttttccaa aataacataa actatagtaa ttatatatag tttgaagtat 1080 

taataaaatt taaatatgca aaagttaatt ttaataaacc atttgtatgc ctaacttgta 1140 

gcctctaaac tattttattt gctttattta tcaaactcat attttatttt attgcacctt 1200 

gttagttttg gacgttaatt atatatattt ggtgtaaaat ttaaaatata ttaacatttg 1260 

tggagaattt atgtatgcct .ggttcttaac tatt.t.tttt.t. tajtataactg gttagagtaa m 1320 . 

. tttcttatat ttcagtattt atttttaaat aagtcctcat aaattgaaga ctttaaaagt 1380 

ttttgtgtca ttcctctttt tatttaagaa attgaagaat tccgctaaat ttcatatttc 1440 

cgctgttatt taactgttta tttcccttgt taatataatt ggtaagaagt tttaaaataa 1500 

aggagttaat gattttctag gttcatggct tgcctagctt ctacgagtaa gcgccatcac 1560 

gactcccgag gataaggaaa tccgggtcgt agcattcact cacaaaaatt actaaaaaca 1620 

aagtttaccc ttctcccaaa agtaaatttc atatttggct ccacataatg tgttcaatga 1680 

gtcaagtgaa gtacttttca tgacaaaaaa aagttgctga aaaatgcata tctcatattt 1740 

tttttttaga gaaatcccat ttcttgccta aacgaaagcc tataaaagag catatattgc 1800 

aacaacagtt tgcagaaact atcaagtcaa ataatccccc ctttaattcc ctcccaaaat 1860 

gcagttcttc aacttctttt cccttttcct ttttgtgtca tttctctttt tatttaagaa 1920 

atggaagaat tccaatagcc aaaccaaaag attgcctcca ggtcccggg 1969 

<210> 95. 
<211> 198 
• <212> PRT 
<213> Gossypium hirsutum 

<400> 95 

Met Val Arg Gly Lys He Gin Met Lys Arg He Glu Asn Ala Thr Ser 
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10 15 



Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala 
20 25 30 

Tyr Glu Leu Tyr Val Leu Cys Asp Ala Glu Val Ala Val He lie Phe 
35 40 45 

Ser His Lys Gly Lys Leu Tyr Glu Phe Ser Ser Ser Asp Asn Met Gin 
50 55 60 

Asn Thr He Glu Arg Tyr Arg Gin Tyr Lys Lys Asp Val Gin Ser Asn 
65 70 75 80 

He Pro Glu Phe Asp Arg Tyr Thr Gin Gin Leu Arg Leu Glu Ala Glu 
85 90 95 



Asn Met Ala Lys Lys He Glu Phe Leu Glu Val Ser Lys Arg Arg Met 
100 105 HO 

Leu Gly Gin Asn Leu Gly Ser Cys Ser He Asp Glu Leu Gin Glu Val 
115 120 125 

Glu Asn Gin Leu Glu Arg Ser Leu Arg Asn He Arg Ala Arg Lys Gly 
130 135 140 

Tyr Leu Phe Lys Glu Gin He Leu Gin Leu Lys Ala Lys Glu Arg Tyr 
145 * 150 155 160 



Met Gin Glu Glu Asn Ala Lys Leu Ser Ala Lys Asn Asn Gly Thr Thr 
165 170 175 



Cys Ser Gin Gin Asn Ala Glu Val Glu Thr Glu Leu Phe Leu Gly Leu 

180 - - 185 190 - 



Pro Glu Asn Arg Cys Ser 
195 



<210> 96 
<211> 463 
<212> PRT 

<213> Gossypium hirsutum 
<400> 96 

Met Lys Arg Val Ser Asn He Tyr He Glu Ala Lys Lys His Leu Ser 
15 10 15 



Asp Leu Leu Thr Asn Glu Asp Gin Asn Glu Asp Leu Leu Ser Thr Gin 
20 25 30 



Val Pro Lys Thr Leu Gly Arg He Leu Ser Leu Pro Glu Tyr Asn Thr 
35 40 45 



Ser Pro Val Ser Ser Pro Gly Gin Asn Leu Glu His Ser Phe Thr Thr 
50 55 60 

Ala His Met Arg Phe Ala Gly Ser Asp Lys Leu Gin Met Val Ser Glu 
65 70 75 80 
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Asn Asp Arg Phe Val Ser Leu Leu Ser Met Arg Ala Glu Lys Thr Asp 
85 90 95 



Gly Gin Leu Cys lie Ser Glu Asn Lys Ser Asp Asn Glu Val Glu Ser 
100 105 110 



Asp Asn Ala He Ser Asn Asn Leu Asp Thr Ser Val Asn Asn Asp Lys 
115 120 125 



Glu Asp Pro He Phe Cys Ser He Lys Asp Glu Leu Ser Ser Lys Glu 
130 135 140 



Ser Val Ser He Val Lys Ala Thr Glu Met Met Val His Glu Glu Ser 
145 150 155 160 



Lys Ser Leu Asp He Ser Ser Glu Thr Ser Gly Ser Ser He He Thr 
165 170 175 



Asp Asp Lys Asn Val Asp He Tyr Glu Val Cys Asp Glu Lys Gin Asn 
180 ~ 185 190 



Pro Trp Tyr Leu Lys Gin Asp Ser Ser Glu Val Asp Gin Gin Pro Phe 
195 200 205 



Ser Pro Leu Ser Ser Pro Ser Asp Ser Ser Val Met Lys Lys Val Glu 
210 215 220 



Cys Leu Glu Ser Val Thr Asp He Pro Glu Arg Ser Ser Pro Val Ser 
225 230 235 240 



Val Leu Glu Pro He Phe Ala Asp Asp Leu He Ser Pro Ala Ser He 
245 250 255 



Arg Ser Tyr Ser Gly Glu Thr Ser He Gin Pro Leu Arg He Arg Phe 
260 265 270 



Glu Glu His Asp Ser Leu Ala Thr Asn Gin Ser Asn Arg He Lys Thr 
275 280 285 



Cys Met Asn Asp Lys Glu Ser He Phe Glu His He Lys Ala Val Leu 
290 295 300 



Gin Ala Ser Ser Phe Ser Trp Asp Glu Val Tyr He Arg Ser Leu Ser 
305 310 315 320 



Ser Asp Leu Leu He Asp Pro Leu Leu Val Asp Glu Val Glu Tyr Leu 
325 330 335 



Pro Asn Gin Leu Cys Gin Asp Gin Lys Leu Leu Phe Asp Cys He Asn 
340 345 350 



Glu Val Val Arg Glu Val Cys Glu Tyr Tyr Phe Gly Ser Pro Ser Val 
355 360 365 



Ser Phe Val Lys Pro Asn He Arg Pro lie Pro Asn Met Gin Asn Thr 
370 ~ 375 380 
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lie Gin Glu Val Trp Glu Gly Val Tyr Trp His Leu Leu Pro Thr Pro 
385 390 395 400 



Leu Pro Cys Thr Leu Asp Leu Val Val Arg Lys Asp Leu Ala Lys Thr 
405 410 415 



Gly Thr Trp Met Asp Leu Gin Leu Asp Thr Gly Tyr lie Gly Val Glu 
420 425 430 



lie Gly Glu Ala lie Phe Glu Asp Leu Val Glu Asp Thr lie Thr Ser 
435 440 445 



Tyr lie Asn Gly Ser Trp Glu Cys Glu Tyr Asn Val Leu Pro Ala 
450 "* 455 4 60 



<210> 97 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 97 

tcagattccg ccttgatagc a 



<210> 98 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 98 

ctctgtggtt gacctcactg tga 



<210> 99 - - --- - 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 99 

cctaccaaga ttcactcccg ate 



<210> 100 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



*<400> 


100 


cctcttacca tagecagaat gacg 


<210> 


101 


<211> 


21 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Single strand DNA oligonucleotide 


<400> 


101 
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tccacaacat tgacgtcagc c 



<210> 102 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 102 

ctaatagaag cccagcagtg g 



<210> 103 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 103 

aaatctagat aagttgataa agctaatttc tc 



<210> 104 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 104 

tttcccggga cctggaggca ate 



<210> 105 

<211> 1969 

<212> DNA 

<213> Gossypium hirsutum 



<400> 105 
tctagataag 


ttgataaagc 


taatttctca 


ttttagctac categctagt aatcgtggca 


60 


ataactaccc 


taactatagc 


atttattget 


accaaataaa atttggcagc taatcataat 


120 


tttttgtcat 


gaatcaatag 


ttattgtagc 


aatagttatc tcttagccac aataaattat 


180 


ttaaaataaa 


atattatagc 


taaataaata 


tttttgcttt aagttctaaa agcttgtggc 


240 


aatagttaaa 


tgatatagtc 


acagatttat 


tggtataatt gaattatgtt gctaatttct 


300 


tagttttttg 


ccacgagtta 


aaaattacca 


atagctatag taacttttta atcacaataa 


360 


aatatttgaa 


agaaaatatt 


gtagctaaat 


gaatattttt tccttcaagt tattaaaagt 


420 


tgtggcaata 


taggttaaat 


tagecacatg 


tttcttgett taatagaatt ttgtagctaa 


480 


tcattaactt 


ttaccacgag 


ttgaacttaa 


tataacaaca ataacctttt aaccataata 


540 


aagegattta 


aatcaaatat 


tactaaataa 


ataactttgc tttcaagttt ctataaaatc 


600 


atggcaatag 


tcattacgat 


aaaatgatat 


aaccacgaat atattgeaac gataaattct 


660 


gtaactaatc 


attagttttt 


gcgacgaggt 


aaattttccg tcacagtagc aatcttctag 


720 


gcacattaaa 


aatttgaaac 


aaaattttgt 


agtcaaataa atatttatct tcttatttta 


780 


agaaaataaa 


aatagttaga 


taatagttac 


tactatttgt catgaaaata tcaatagata 


840 


caaatttaaa 


gtgactataa 


atttacgagt 


ttactatact ttagtegtae agtttgcaat 


900 


aatagtattt 


taaccacaat 


tagttatatg 


tacaaaataa cataagtgaa taactttttt 


960 
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tcaatgagaa aataagagtt gctcaaacaa tatcaagtta caaaaattta attttaactg 1020 

taaaagttat atttttccaa aataacataa actatagtaa ttatatatag tttgaagtat 1080 

taataaaatt taaatatgca aaagttaatt ttaataaacc atttgtatgc ctaacttgta 1140 

gcctctaaac tattttattt gctttattta tcaaactcat attttatttt attgcacctt 1200 

gttagttttg gacgttaatt atatatattt ggtgtaaaat ttaaaatata ttaacatttg 1260 

tggagaattt atgtatgcct ggttcttaac tatttttttt tatataactg gttagagtaa 1320 

tttcttatat ttcagtattt atttttaaat aagtcctcat aaattgaaga ctttaaaagt 1380 

ttttgtgtca ttcctctttt tatttaagaa attgaagaat tccgctaaat ttcatatttc 1440 

cgctgttatt taactgttta tttcccttgt taatataatt ggtaagaagt tttaaaataa 1500 

aggagttaat gattttctag gttcatggct tgcctagctt ctacgagtaa gcgccatcac 1560 

gactcccgag gataaggaaa tccgggtcgt agcattcact cacaaaaatt actaaaaaca 1620 

aagtttaccc ttctcccaaa agtaaatttc atatttggct ccacataatg tgttcaatga 1680 

gtcaagtgaa gtacttttca tgacaaaaaa aagttgctga aaaatgcata tctcatattt 1740 

tttttttaga gaaatcccat ttcttgccta aacgaaagcc tataaaagag catatattgc 1800 

aacaacagtt tgcagaaact atcaagtcaa ataatccccc ctttaattcc ctcccaaaat 1860 

gcagttcttc aacttctttt cccttttcct ttttgtgtca tttctctttt tatttaagaa 1920 

atggaagaat tccaatagcc aaaccaaaag attgcctcca ggtcccggg 1969 

<210> 106 
<211> 356 
<212> PRT 

<213> Gossypium hirsutum 
<400> 106 

Met Asp Gly Tyr Cys Ser Arg Arg Val He Met Phe Leu Val Phe Ala 



Phe Ala Ala He Ser Arg Gly Tyr Gly Gin Glu Ser Thr Thr Leu Val 
20 25 30 

Pro Ala lie He Thr Phe Gly Asp Ser Val Val Asp Val Gly Asn Asn 
35 40 45 

Asp Tyr Leu Pro Thr He Phe Lys Ala Asn Tyr Pro Pro Tyr Gly Arg 
50 55 60 

Asp Phe Ala Asn Lys Lys Pro Thr Gly Arg Phe Cys Asn Gly Lys Leu 
65 70 75 80 

Ala Thr Asp He Thr Ala Glu Thr Leu Gly Phe Thr Thr Tyr Pro Pro 
85 90 95 

Ala Tyr Leu Ser Pro Glu Ala Ser Gly Lys Asn Leu Leu Leu Gly Ala 
100 105 110 

Asn Phe Ala Ser Ala Gly Ser Gly Tyr Asp Asp Lys Ala Ala Met Val 
115 120 125 

Asn His Ala He Thr Leu Thr Gin Gin Leu Glu Tyr Phe Lys Glu Tyr 

140 
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Gin Ala Lys Leu Ala Lys Val Ala Gly Ser Thr Lys Ser Ala Ser lie 
145 150 155 160 



Thr Lys Asp Ala Leu Tyr Val Leu Ser Ala Gly Ser Gly Asp Phe Leu 
165 170 175 



Gin Asn Tyr Tyr Val Asn Pro Leu Leu Asn His Ala Tyr Thr Pro Asp 
180 185 190 



Gin Tyr Gly Ser Phe Leu lie Asp Thr Phe Thr Asn Phe Val Lys Asn 
195 200 205 



Leu Tyr Gly Leu Gly Ala Arg Lys lie Gly Val Thr Ser Leu Pro Pro 
210 215 220 



Leu Gly Cys Val Pro Leu Ala Arg Thr Leu Phe Gly Tyr His Glu Lys 
225 230 235 240 



Gly Cys lie Ser Arg Phe Asn Thr Asp Ala Gin Gin Phe Asn Lys Lys 
245 250 255 



Leu Asn Ala Ala Ala Ala Asn Leu Gin Lys Gin His Pro Gly Leu Lys 
260 265 270 



lie Val Val Phe Asp lie Phe Lys Ala Leu Tyr Asp He Val Lys Ser 
275 280 285 



Pro Ser Asn Tyr Gly Phe Val Glu Ala Thr Lys Gly Cys Cys Gly Thr 
290 295 300 



Gly Thr Val Glu Thr Thr Ala Phe Leu Cys Asn Pro Lys Ala Pro Gly 
305 310 315 320 



Thr Cys Ser Asn Ala Ser Gin Tyr Val Phe Trp Asp Ser Val His Pro 
325 330 335 



Ser Gin Ala Ala Asn Gin Val Leu Ala Asp Ala Leu lie Val Gin Gly 
340 345 350 



He Ala Leu He 
355 



<210> 107 

<211> 645 

<212> PRT 

<213> Gossypium hirsutum 

<400> 107 

Met Glu Ala Ser Ser Ser Thr Ser His Asp Pro Ala Leu Phe His Ala 
1 5 10 15 " 



Pro Leu Leu Tyr His Pro Arg Arg Arg Ser Ser Arg Pro Leu Lys Gly 
20 25 30 



Phe Ala Val lie He Gly Ser Val Val Phe. Leu Leu Ser Leu Val Thr 
35 40 45 



Leu He Val Asn Gin Ser Pro Glu Pro Leu Ala Ser Asn Pro Ser Ser 
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50 55 60 



Val Thr Glu Ala Gly Ser Tyr Ser Met Ala Ala Gin Pro Arg Gly He 
65 70 75 80 

Ala Glu Gly Val Ser Ala Lys Ser Asn Pro Ser Leu Phe Asp Lys Val 

85 90 95 



Gly Phe Asn Trp Thr Asn Ala Met Phe Tyr Trp Gin Arg Thr Ala Tyr 
100 105 110 



His Phe Gin Pro Gin Lys Asn Trp Met Asn Asp Pro Asp Gly Pro Leu 
115 120 125 



Tyr His Lys Gly Trp Tyr His Leu Phe Tyr Gin Tyr Asn Pro Asp Ser 
130 135 140 



Ala He Trp Gly Asn He Thr Trp Gly His Ala Val Ser Thr Asp Leu 
145 150 155 160 



He His Trp Phe Tyr Leu Pro Leu Ala Met Val Pro Asp Gin Trp Tyr 
165 170 175 



Asp He Asn Gly Cys Trp Thr Gly Ser Ala Thr Leu Leu Pro Asp Gly 
180 185 190 



Arg He Val Met Leu Tyr Thr Gly Ser Thr Asn Asp Ser Val Gin Val 
195 200 205 



Gin Asn Leu Ala Tyr Pro Ala Asn Leu Ser Asp Pro Leu Leu Leu Gin 
210 215 220 



Trp Leu Lys Tyr Pro Gly Asn Pro Val Val Val Pro Pro Thr Gly He 

225 230 235 240.. 



Glu Asp Glu Glu Phe Arg Asp Pro Thr Thr Ala Trp Leu Gly Pro Asp 
245 250 255 



Gly Ser Trp Arg He Val Val Gly Thr Arg Phe Asn Thr Thr He Gly 
260 265 270 



Thr Ala Leu Val Phe Gin Thr Thr Asn Phe Ser Asp Tyr Glu Leu Leu 
275 280 285 



Asp Gly Val Leu His Ala Val Pro Gly Thr Gly Met Trp Glu Cys Val 
290 295 300 



Asp Phe Tyr Pro Val Ala He Asn Gly Ser Val Gly Leu Asp Thr Thr 
305 310 315 320 



Ala Leu Gly Pro Gly He Lys His Val Leu Lys Ala Ser Leu Asp Asp 
325 330 335 



Thr Lys Val Asp His Tyr Ala He Gly .Thr Tyr Asp Met He Thr Asp 
340 345 350 



Lys Trp Thr .Pro Asp Asn Pro Glu Glu Asp Val Gly He Gly Leu Lys 
355 360 . 365 
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Val Asp Tyr Gly Arg Tyr Tyr Ala Ser Lys Thr Phe Phe Asp Gin Ser 
370 ~ 375 380 

Lys Gin Arg Arg lie Leu Tyr Gly Trp Val Asn Glu Thr Asp Ser Glu 
385 390 395 400 

Ala Asp Asp Leu Glu Lys Gly Trp Ala Ser lie Gin Thr lie Pro Arg 

405 410 415 



Ser Val Leu Tyr Asp Asn Lys Thr Gly Thr His Leu Leu Gin Trp Pro 
420 425 430 



Val Glu Glu Val Glu Ser Leu Arg Leu Asn Ala Thr Val Phe Lys Asp 
435 440 445 

Val Val Val Glu Ala Gly Ser Val Val Pro Leu Asp He Gly Thr Ala 
450 455 460 

Thr Gin Leu Asp lie Leu Ala Glu Phe Glu He Glu Thr Leu Val Leu 
465 470 475 480 



Asn Ser Thr Glu Asp Glu Val Ser Asp Cys Gly Asp Gly Ala Val Asp 
485 490 495 



Arg Ser Thr Tyr Gly Pro Phe Gly Val Leu Val He Ala Asp Asp Ser 
500 505 510 



Leu Ser Glu Leu Thr Pro He Tyr Phe Arg Pro Leu Asn Thr Ser Asp 
515 520 525 



Gly Ser Leu Glu Thr Tyr Phe Cys Ala Asp Glu Thr Arg Ser Ser Lys 
530 535 540 



Ala Pro Asp Val Thr Lys Arg Val Tyr Gly Gly Lys He Pro Val Leu 
545 550 555 560 



Asp Asp Glu Asn Tyr Asn Met Arg Val Leu Val Asp His Ser Val Val 
565 570 575 



Glu Ser Phe Gly Gly Gly Gly Arg Thr Val He Thr Ser Arg Val Tyr 
580 585 590 



Pro Thr Glu Ala He Tyr Gly Ala Ala Arg Leu Phe Leu Phe Asn Asn 
595 600 605 



Ala Ser Gly Val Asn Val Lys Ala Thr Leu Lys He Trp Glu Met Asn 
610 615 620 



Ser Ala Phe He Arg Pro Phe Pro Phe Glu Glu Thr Leu Phe Gin Glu 
625 630 635 640 



Met Val Ala Ser Thr 
645 



<210> 108 
<211> 180 
<212> PRT 
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<213> Gossypium hirsutura 
<400> 108 

Met Glu Leu Ser lie Gin Lys lie Glu Ala Leu lie Arg Leu Ser Thr 
x 5 10 15 

He Val Met Leu Val Leu Thr Ala Cys Leu He Gly Leu Asp Ser Gin 
20 25 30 

Thr Lys Val He Phe Tyr Val Gin Lys Lys Ala Ser Phe Lys Asp Leu 
35 40 45 

Arg Ala Leu Val Gly Leu Leu Tyr He Thr Ser Leu Ala Ala Ala Tyr 
50 55 60 

Asn Leu Leu Gin Leu Cys Cys Ser Ser Phe Ser Ala Ser Tyr Lys Gly 
65 70 75 80 

Thr Ser Leu Gin Ser Tyr Ala Tyr Leu Ala Trp Leu Arg Tyr He Leu 
85 90 95 

Asp Gin Ala Val Val Tyr Ala Val Phe Ala Gly Asn Leu Ala Ala Leu 
100 105 110 

Glu His Ser Phe Leu Val Leu Thr Gly Glu Glu Asn Phe Gin Trp Leu 
115 120 125 

Lys Trp Cys Asn Lys Tyr Thr Arg Phe Cys Thr Gin He Gly Gly Ser 
130 135 140 

Leu Leu Cys Gly Phe Val Ala Ser Leu Leu Met Phe Ser He Ala Ser 
145 . 150 155 160 

lie Ser "Ala " Phe -Asn Leu Phe Arg Leu Tyr Ser Pro -Thr Lys Phe Met 
165 170 175 



His Leu Lys Leu 
180 



<210> 109 

<211> 189 

<212> PRT 

<213> Gossypium hirsutum 

<400> 109 

Met Ala Glu He Leu Arg Lys Pro Ser Val Leu Lys Lys Leu Leu Leu 
1 5 10 15 



Glu Leu Asp Gin Val Val Gly Lys Asp Arg. Phe Val Val Glu Ser Asp 
20 25 30 



He Pro Lys Leu Thr Tyr Leu Gin Ala Val Val Lys Glu Val Phe Arg 
35 40 45 



Leu His Pro Gly Val Pro Leu He He Pro Arg Arg Thr Asn Glu Ala 
50 55 60 

Cys Glu Val Ala Gly Tyr His He Pro Lys His Cys He Val Tyr Val 
65 70 75 80 
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Asn Val Trp Gly Met Ala Arg Asp Pro Asn Val Trp Glu Asp Pro Leu 
85 90 95 



Glu Phe Lys Pro Glu Arg Phe lie Gly Ser Ser Val Asp Val Lys Gly 
100 105 110 



Gin Asp Phe Asn Leu Leu Pro Phe Gly Thr Gly Arg Arg Ser Cys Val 
115 120 * 125 



Gly Trp Pro Leu Ala His Arg Met Val His Tyr Tyr Leu Ala Ala Leu 
130 135 140 



Leu His Ala Phe Gin Trp Glu Ser Pro Pro Asp Val Leu Asn Asp Leu 
145 150 155 160 



Gly Glu Arg Val Gly Leu Thr lie Gin Lys Gly Lys Ser Leu Leu Ser 
165 170 175 



Thr Pro Lys Pro Arg Leu Pro Ala Ser Val Tyr Glu Arg 
180 ** 185 



<210> 110 
<211> 468 
<212> PRT 

<213> Gossypium hirsutum 
<400> 110 

Met Ala Ser Leu Pro Phe lie Phe Phe Leu Ser Phe Phe lie lie Ser 
15 10 15 



Thr Thr Leu Thr Ser Ala Gly Ala Ala Ala Ala Thr lie Lys Leu Ser 
20 25 30 



Leu Ser Pro Phe Pro His Pro Ser Ser Ser His Pro Tyr Gin He Leu 
35 40 45 



Asn Asn Leu Val Thr Ser Ser Val Ala Arg Ala His His Leu Lys His 
50 55 60 



Pro Lys Ala Lys Ala Asp Asn Thr Thr Ser Ser Leu Leu Arg Ala Pro 
65 70 75 80 



Leu Phe Ser His Ser Tyr Gly Gly Tyr Thr He -Ser Leu Lys Phe Gly 
85 90 95 



Thr Pro Pro- Gin Thr Leu Pro Phe Val Met Asp Thr Gly Ser Ser Leu 
100 105 110 



Ser Trp Phe Pro Cys Thr Ser Arg Tyr Leu Cys Ser Gin Cys Ala Phe 
115 120 125 



Pro Asn Val Asp Pro Ala Lys He Pro Thr Phe Ala Pro Lys Leu Ser 
130 135 140 



Ser Ser Ser Lys Leu Val Gly Cys- Arg Asn Pro Lys Cys Ser Trp Leu 
145 150- ~ 155 ~ 160 
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Phe Gly Pro Asp Val Glu Ser Arg Cys Gin Asp Cys Glu Pro Thr Ser 
165 170 175 



Glu Asn Cys Thr Gin Thr Cys Pro Pro Tyr lie lie Gin Tyr Gly Leu 
180 185 190 



Gly Ser Thr Ala Gly Leu Leu Leu Val Glu Asn Leu Ala Phe Pro Gin 
195 200 205 



Lys Thr Phe Gin Asp Phe Leu Val Gly Cys Ser He Leu Ser Asn Arg 
210 215 220 



Gin Pro Ala Gly He Ala Gly Phe Gly Arg Ser Ala Glu Ser He Pro 
225 230 235 240 



Ser Gin Leu Gly Leu Lys Lys Phe Ser Tyr Cys Leu Val Ser Arg Arg 
245 250 255 



Phe Asp Asp Thr Gly Val Ser Ser Asn Met Leu Leu Glu Thr Gly Ser 
260 265 270 



Gly Ser Gly Asp Ala Lys Thr Pro Gly Leu Ser Tyr Thr Pro Phe Tyr 
275 280 285 

Arg Asn Gin Val Ala Ser Asn Pro Val Phe Lys Glu Phe Tyr Tyr Val 
290 295 300 



Thr Leu Arg Lys He Leu Val Gly Asp Lys His Val Lys Val Pro Tyr 
305 310 315 320 



Ser Tyr Leu Val Pro Gly Ser Asp Gly Asn Gly Gly Thr He Val Asp 
325 330 335 

•Ser Gly -Ser Thr- Phe Thr Phe- Met- Glu Arg Pro Val Phe- Glu -Val Val - 
340 345 350 



Ser Lys Glu Phe Glu Lys Gin Met Gly Asn Tyr Arg Arg Val Arg Glu 
355 360 365 



He Glu Asn Arg Ser Gly Leu Ala Pro Cys Phe Asn Thr Ser Gly Tyr 
370 375 380 



Thr Ser lie Glu He Pro Glu Leu Ser Phe Gin Phe Lys Gly Gly Ala 
385 390 395 400 



Lys Met Ala Leu Pro Leu Val Asn Tyr Phe Ser Phe Asp Gly Asp Asp 
405 410 415 



Lys Val Val Cys Leu Met He Val Ser Asn Asn Val Val Gly Gin Gly 
420 425 430 



Ser His Ser Gly. Pro Ala He He Leu Gly Ser Phe Gin Gin Gin Asn 
435 440 445 



Tyr Tyr He Glu Phe Asp He Ala Asn Asn Arg Phe Gly Trp Ala Glu 
450 455 460 



Arg Ser Cys Ala 
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465 



<210> 111 

<211> 451 

<212> PRT 

<213> Gossypium hirsutum 

<400> 111 



Met Ala Gly Val Glu Ala Gly Lys Glu Glu Glu Ala Thr Ala Val Arg 
1 5 10 15 



lie Thr Gly Lys Ser His Val Lys Pro Gly Lys Leu lie Gly Arg Lys 
20 25 30 



Glu Cys Gin Leu Val Thr Phe Asp Leu Pro Tyr Leu Ala Phe Tyr Tyr 
35 40 45 



Asn Gin Lys Leu Leu Phe Tyr Lys Asn Asp Gly Gly Gly Glu Phe Glu 
50 55 60 



Asp Lys Val Glu Lys Leu Lys Gly Gly Leu Arg Val Val Leu Glu Glu 
65 70 75 80 



Phe Tyr Gin Leu Gly Gly Lys Leu Gly Lys Asp Asp Asp Gly Val Leu 
85 90 95 



Arg Val Asp Tyr Asp Asp Asp Met Asp Gly Val Glu Val Val Glu Ala 
100 105 110 



Val Ala Glu Gly lie Thr Val Asp Glu Leu Thr Gly Asp Asp Gly Thr 
115 120 125 



Ser Ser Phe Lys Glu Leu lie Pro Phe Asn Gly Val Leu Asn Leu Glu 
130 135 140 



Gly Leu His Arg Pro Leu Leu Ser lie Gin Leu Thr Lys Leu Lys Asp 
145 150 155 160 



Gly Val Ala Met Gly Cys Ala Phe Asn His Ala He Leu Asp Gly Thr 
165 170 175 



Ser Thr Trp His Phe Met Ser Ser Trp Ala Gin He Cys Asn Gly Thr 
180 185 190 



Ser Ser Ser Val Val Val Pro Pro Phe Leu Asp Arg Thr Thr Ala Arg 
195 200 205 



Asn Thr Arg Val Lys .Leu Asp Leu Ser Pro Val Val Ser Cys Asn Gly 
210 215 220 



Asp Asp Ala Thr Lys Gin Gly Gin Pro Ala Pro Gin Met Arg Glu Lys 
225 230 235 240 



Leu Phe Arg Phe Ser Glu Ala Ala Val Asp Lys lie Lys Ser Arg Val 
245 250 255 



Asn Ser Thr Pro Pro Pro Ser Asp Gly Ser Lys Pro Phe Ser Thr Phe 
260 265 270 
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Gin Ser Leu Ala Val His lie Trp Arg His Val Ser Gin Ala Arg Asn 
275 280 285 



Leu Lys Pro Glu Asp Tyr Thr Val Phe Thr Val Phe Ala Asp Cys Arg 
290 295 300 



Lys Arg Val Asp Pro Pro Met Pro Asp Ser Tyr Phe Gly Asn Leu lie 
305 310 315 320 



Gin Ala lie Phe Thr Ala Thr Ala Ala Gly Leu Leu Leu Glu Asn Pro 
325 330 335 



Pro Ser Phe Gly Ala Ser Val lie Gin Lys Ala lie Glu Ser His Asp 
340 345 350 



Ala Lys Ala lie Asp Glu Arg Asn Lys Ala Trp Glu Ala Ala Pro Lys 
355 ' 360 365 



He Phe Gin Phe Lys Asp Ala Gly Val Asn Cys Val Ala Val Gly Ser 
370 375 380 



Ser Pro Arg Phe Lys Val Tyr Glu Val Asp Phe Gly Trp Gly Lys Pro 
385 390 395 400 



Val Gly Val Arg Ser Gly Ser Asn Asn Arg Phe Asp Gly Met Val Tyr 
405 410 415 



Leu Tyr Gin Gly Lys Ser Gly Gly Arg Ser He Asp Val Glu lie Thr 
420 ~ 425 430 



Met Glu Ala Gin Ala Met Glu Lys Leu Glu Lys Asp Lys Glu Phe Leu 
435 440 445 



Met Glu Val 
450 



<210> 112 
<211> 467 
<212> PRT 

<213> Gossypium hirsutum 
<400> 112 

Met Ser Thr Gin Ser Arg Ala Val Gly Gly Thr Glu His Asn Trp Cys 
15 10 15 



Arg Ala Val Val Gly Gly Thr Gly He Ala Val Leu Ala He lie Ser 
20 25 30 



Ser Lys Asn Pro Asp Val Ser His Leu Lys Asn Ala Leu His Lys Leu 
35 40 45 



Gin He Ser His Pro He Leu Arg Ser Arg Leu His Tyr Ser Pro Thr 
50 55 60 



Ala Asn Ser Tyr Ser Phe Val Thr Ser Pro Ser Pro Phe He Gin He 
65 70 75 80 



Lys Tyr Phe Asn His Ser Thr Thr Cys Gin He Leu Glu Asn Asn Gin 
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85 90 95 



Asn He Ser Pro Leu His Leu He Leu Glu His Glu Leu Asn Gin Asn 
100 105 110 



Ala Trp Val Ser Ser Ser Cys Thr Thr Lys His Asp Val Phe Phe Ala 
115 120 125 



Ser Val Tyr Ala Leu Pro Gly Ala Thr Arg Trp Val Leu Val Leu Arg 
130 135 140 



Leu His Ala Ala Ala Cys Asp Arg Thr Thr Ala Val Ser Leu Leu Arg 
145 150 155 160 



Glu Leu Leu Thr Leu Met Ala He Glu Glu Glu Glu Thr Gly Phe Gin 
165 170 175 



Gin Gly Gin Lys Glu lie Thr Met Asn Lys Gly Glu He Ser Leu Ala 
180 185 190 



Met Glu Asp He Leu Pro Lys Gly He val Lys Lys Thr Leu Trp Ala 
195 200 205 



Arg Gly Val Asp Met Leu Ser Tyr Ser Val Asn Ser Leu Arg Phe Thr 
210 215 220 



Asn Leu. Arg Phe Lys Asp Ala Lys Ser Pro Arg Ser Thr Gin Val Val 
225 230 235 240 



Arg Leu Leu He Asn Pro Asp Asp Thr Gin Lys He Leu Thr Gly Cys 
245 250 255 



Lys Ala Arg Gly He Lys Leu Cys Gly Ala Leu Gly Ala Ala Gly Leu 
260 265 270- - - - - 



He Ser Ala His Ser Ser Lys Ser Arg Ser Asp His Gin Lys Lys Lys 
275 280 285 



Tyr Gly Val Val Thr Leu Thr Asp Cys Arg Ser He Leu Glu Pro Pro 
290 295 300 



Leu Ser Asn His His Phe Gly Phe Tyr His Ser Ala He Leu Asn Thr 
305 310 315 320 



His Ala He Lys Gly Gly Glu Lys Leu Trp Glu Leu Ala Glu Lys Val 
325 330 335 



Tyr Thr Val Phe Thr His Tyr Lys Ser Cys Asn Lys His Leu Ser Asp 
340 345 350 



Met Ala Asp Leu Asn Phe Leu Met Cys Arg Ala Met Glu Asn Pro Gly 
355 360 ~ 365 



Leu Thr Pro Ser Ala Ser Leu Arg Thr Cys Leu He Ser Val Phe Glu 
370 375 380 



Asp Thr Val He Asp Glu Ser Ser Asn Gin Gin Asn Gin Val Gly Val 
385 390 395 400 
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Glu Asp Tyr Met Gly Cys Ala Ser Ala His Gly lie Ala Pro Ser lie 
405 410 415 

Ala He Phe Asp Thr He Arg Asp Gly Arg Leu Asp Cys lie Cys Val 

420 425 430 



Tyr Pro Ser Pro Leu His Ser Arg Glu Gin Met Gin Glu Leu Val Asp 
435 440 445 

Asn Met Lys Cys He Leu Val Asp Ala Gly Lys Asn Val Ala Asp Glu 
450 455 460 



Thr Glu Ser 
465 



<210> 113 
<211> 223 
<212> PRT 

<213> Gossypium hirsutum 
<400> 113 

Met Gly Arg Gly Lys He Glu He Lys Arg He Glu Asn Thr Thr Asn 
15 10 15 

Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala 
20 - 25 30 



Tyr Glu Leu Ser Val Leu Cys Asp Ala Glu Val Ala Leu He Val Phe 
35 40 45 



Ser Ser Arg Gly Arg Leu Tyr Glu Tyr Ser Asn Asn Asn He Arg Ser 
50 55 60 



Thr He Asp Arg Tyr Lys Lys Ala Cys Ser Asp Thr Ser Asn Thr Asn 
65 70 75 80 

Thr Val Thr Glu He Asn Ala Gin Tyr Tyr Gin Gin Glu Ser Ala Lys 
85 90 95 



Leu Arg Gin Gin He Gin Met Leu Gin Asn Ser Asn Arg His Leu Met 
100 105 HO 



Gly Asp Ser Leu Ser Ser Leu Thr Val Lys Glu Leu Lys Gin Val Glu 
115 120 125 



Asn Arg Leu Glu Arg Gly He Thr Arg He Arg Ser Lys Lys His Glu 
130 135 140 



Met Leu Leu Ala Glu He Glu Phe Leu Gin Lys Arg Glu He Glu Leu 
145 150 155 160 



Glu Asn Glu Ser Val Cys Leu Arg Thr Lys He Ala Glu He Glu Arg 
165 170 175 



Leu Gin Gin Ala Asn Met Val Thr Gly Pro Glu Leu Asn Ala He Gin 
180 185 190 



47 



WO 2005/121364 



PCT/IL2005/000627 



Ala Leu Ala Ser Arg Asn Phe Phe Ser Pro Asn Val He Glu His Pro 
195 200 205 



Ser Ala Tyr Ser His Leu Ser Asp Lys Lys He Leu His Leu Gly 
210 215 220 



<210> 114 
<211> 310 
<212> PRT 

<213> Gossypium hirsutum 
<400> 114 

Met Asp Val Thr Ser Thr Pro Asn Arg Lys Glu Met Asp Arg lie Lys 
15 10 15 



Gly Pro Trp Ser Pro Glu Glu Asp Asp Leu Leu Gin Gin Leu Val Gin 
20 * 25 30 



Lys His Gly Pro Arg Asn Trp Ser Leu He Ser Lys Ser He Pro Gly 
35 40 45 



Arg Ser Gly Lys Ser Cys Arg Leu Arg Trp Cys Asn Gin Leu Ser Pro 
50 55 60 



Gin Val Glu His Arg Ala Phe Thr Pro Glu Glu Asp Glu Thr He He 
65 70 75 80 



Arg Ala His Ala Arg Phe Gly Asn Lys Trp Ala Thr He Ala Arg Leu 
85 90 95 



Leu Asn Gly Arg Thr Asp Asn Ala He Lys Asn His Trp Asn Ser Thr 
100 105 110 



Leu Lys Arg Lys Cys Leu Pro Val Gly Glu Glu Cys Asn Phe Val Ala 
1-15 -- 120 125- 



Asn Gly Gly Tyr Asp Gly Asn Leu Gly Gly Glu Glu Arg Gin Pro Leu 
130 135 140 



Lys Arg Ser Val Ser Ala Gly Leu Tyr Met Ser Pro Gly Ser. Pro Ser 
145 150 155 160 



Gly Ser Asp Val Ser Asp Ser Ser Val Pro Val Leu Ser Ser Ser Tyr 
165 170 175 



Val Tyr Lys Pro He Pro Arg Thr Gly Gly Val Asn Val Asp Val Asn 
180 185 190 



Val Thr Pro Ala Gly Val Glu Ala Ala Ser Ser Ser Asn Asp Pro Pro 
195 200 205 



Thr Ser Leu Ser Leu Ser Leu Pro Gly Val Glu Ser Cys Glu Val Val 
210 215 220 



Ser Thr Gin Pro He Thr Glu Ser Thr Gin Asn Arg Ser Glu Glu Arg 
225 230 235 240 



Gly Gly Gly Val Met Gly Phe Ser Ala Glu Phe Met Ala Val Met Gin 
245 250 255 



48 



WO 2005/121364 



PCT/IL2005/000627 



Glu Met lie Arg Val Glu Val Arg Asn Tyr Met Thr Gin Met Gin Gin 
260 265 270 



Gin Gin Gin Gin Gin Asn Gly Ala Val Pro Gly Gly Ala Gly Met Gly 
275 280 285 



Met Cys Leu Asp Gly Gly Phe Arg Asn Leu Met Ala Val Asn Pro Val 
290 295 300 



Gly Met Ser Lys lie Glu 
305 310 



<210> 115 
<211> 593 
<212> PRT 

<213> Gossypium hirsutum 
<400> 115 

Met Gly Gly Pro Pro Tyr Asp Cys Leu Ala Asn Pro Leu Gly Ala Val 
15 10 15 

Arg Leu Thr Phe Glu Lys Ala He Trp Ser Glu Ser Glu Thr Pro Pro 
20 25 30 



He His Pro Ser Ala Phe Asn Gly Lys Asp Trp Gly Ala Leu Glu Leu 
35 40 45 



Phe Arg His Phe Leu Phe Gin Gly Ser Gly Leu Ser Gin Val Pro He 
50 55 60 



Leu Asn Pro Lys Thr Leu Arg Trp Val Gin Pro Asn Ser Leu Val Arg 
65 70 75 80 

Tyr- Arg Gly Met He Gin Asp Met Leu Gly Asn Glu Phe Tyr Ala Gly 
85 90 95 



Ala Tyr Lys Asp Gly Asn Leu Trp Arg Thr Asn Lys Phe Met Asp Val 
100 105 HO 



Ser Gin Tyr Pro Met Gly Ser Ser Pro Asp Met Cys He Trp Glu Arg 
115 120 125 



Arg Leu Leu Tyr Cys Val Pro Val Pro Gly Gin Asn Ser Trp Thr Glu 
130 ' 135 140 



Pro Ser Ser Glu Met Glu Pro Asn Trp Ser Ser Gin Thr Arg Glu Lys 
145 150 155 160 



Arg Arg Arg Met Asp Asp Glu Asp Asn Asp Pro Met Asp Leu Val Pro 
165 170 175 



Asp Asp Glu He Lys Ser Ser Pro He Thr Lys Lys Met Arg Glu Asp 
180 185 190 



Gly Leu Pro Ser Pro Ser Gin Ser Arg Asp Thr Lys Thr Thr Ser Ser 
195 200 205 
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Ser Ser lie Thr Ser Thr Phe Gin Ser Val Asp Glu Asp Asn Leu Pro 
210 215 220 



Cys Leu Val Lys lie Tyr Asp Ser Pro Glu Ser Glu Leu Lys Leu Asn 
225 230 235 240 

Asp Val Phe Glu Phe lie Gly Val Leu Thr Phe Asp Ser Glu Leu Ala 
245 250 255 



Val Glu Lys Asp Asp Asn Asp Glu Leu Ser Asn Ser Phe Tyr Asp Asp 
260 " 265 270 



Ala Leu Val His Leu Pro Pro Asn Lys Val Pro Arg Leu His Cys Leu 
275 280 285 



lie His Arg Lys Leu Ala Val Gin Asp Phe Leu Pro Gly Ser Pro He 
290 295 300 



He Glu Pro Lys Pro His Leu Val Lys Glu Thr Arg Glu Ala Leu Phe 
305 310 315 320 



Arg His Leu Thr Ala Val Leu Gly Asn Asp Glu Val Ala Ala His Phe 
325 330 335 



Val Leu Leu His Leu Leu Ser Lys Val His Ala Arg Val Asp Asp Val 
340 345 350 



Ala Val Gly Lys Leu Ser Leu Asn Leu Thr Gly Leu Asn Lys Glu Ser 
355 360 365 



Val Ser Val Phe Gly Thr Arg Leu Ser Asp Thr Phe Lys Asn Leu Leu 
370 375 " 380 



Pro. Phe Thr- Asn Cys. Met Pro Leu! Thr Leu Glu. Tyr . Leu. Asn . lie Ala. 
385 390 395 - 400 



Ser Leu Ala Pro Gin Lys Asp Tyr Gin Ala Asn Arg Leu Val Pro Gly 
405 410 415 



Val Leu Gin Leu Pro Glu Gly Ser His Leu Met Val. Asp Glu Thr Arg 
420 425 430 



Leu Glu Ser Gly Ser Leu Asn Ser Thr Gly lie Glu Asn Thr Lys Leu 
435 440 445 



Leu Lys Asn Leu lie Glu Phe Gin Lys Val Glu Tyr Asp Phe Gin Tyr 
450 455 460 



Tyr Lys Val Glu Met Ala Thr Asp Val Gin Leu Leu He Phe Ser Glu 
465 470 475 ' 480 



Gly Lys Ser Asn lie Val Pro Ala Asp Val He Val Pro Phe Gin Pro 
485 " 490 495 



Ser Cys Leu Glu Ser Thr Glu Met Pro Val Ala Glu Ala Leu Glu Ala 
500 505 510 



Trp Arg Trp Tyr Leu Ala Thr Val Arg Ser Leu Pro His Ser- He Gly 
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515 520 525 

Ser Glu lie Gin Lys Val Val Glu Asp Asp Leu Val Ala Ala Arg Gin 
530 535 540 

Met Asp Arg Ser Leu Gly Ser Arg Asp Phe Ser Arg Trp Leu Thr Met 
545 " 550 555 560 

Ala Arg Leu lie Ser Ser Ser Phe Gly Glu Thr Ser Leu Ser Lys Glu 
565 570 575 



His Trp Glu Met Ala Lys Glu Met Glu Arg Leu Arg Arg Glu Arg Leu 
580 585 590 



Lys 



<210> 116 

<211> 89 

<212> PRT 

<213> Gossypium hirsutum 



<220> 

<221> misc_feature 
<222> (48).. (48) 

<223> Xaa can be any naturally occurring amino acid 
<400> 116 

Met Ser Met Lys Lys Glu Gly Glu lie Leu Tyr Lys Lys Gly Leu Trp 
15 10 15 

Ala Met Glu Glu Asp Lys Leu Leu lie Asp Tyr Val Asn Val His Gly 
20 25 30 



Lys Gly- Gin- Trp Asn Lys -lie Ala Asn Arg . Thr.. Gly . Leu Lys Arg Xaa 
35 40 45 



Gly Lys Ser Cys Arg Leu Arg Trp Met Asn Tyr Leu Ser Pro Asn Val 
50 55 . 60 



Lys Lys Gly Asp Phe Ser Glu Glu Glu Glu Asp Leu Val lie Arg Leu 
65 70 75 80 



His Lys Leu Leu Glu Thr Gly Gly Leu 
85 



<210> 117 
<211> 628 
<212> PRT 

<213> Gossypium hirsutum 
<400> 117 

Met Ser Thr Gly Val Gin His Gin Glu Arg Val Pro Met Asn Leu Lys 
15 10 15 



Lys Gin Leu Ala Leu Ala Val Arg Asn He Gin Trp Ser Tyr Ala He 
20 25 30 



Phe Trp Ser He Ser Thr Arg Gin Pro Gly Val Leu Glu Trp Gly Glu 
35 40 45 
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Gly Tyr Tyr Asn Gly Asp lie Lys Thr Arg Lys Thr Val Gin Ser Val 
50 55 60 



Glu Leu Asn Thr Asp Gin Leu Ser Leu Gin Arg Ser Glu Gin Leu Arg 
65 70 75 80 



Gin Leu Tyr Glu Ser Leu Ser Ala Gly Glu Ser Ser Pro Gin Ala Lys 
85 90 95 



Arg Pro Ser Ala Ala Leu Ser Pro Glu Asp Leu Thr Asp Thr Glu Trp 
100 105 110 



Tyr Tyr Leu Val Cys Met Ser Phe Val Phe Asn lie Gly Gin Gly Leu 
115 120 125 



Pro Gly Arg Thr Leu Ser Thr Gly Gin Pro Val Trp Leu Cys Asn Ala 
130 135 140 



His Cys Ala Asp Ser Lys Val Phe Gly Arg Ser Leu Leu Ala Lys Ser 
145 150 155 160 



Ala Ser lie Gin Thr Ala Val Cys Phe Pro Phe Ser Gly Gly Val Val 
165 170 175 



Glu Leu Gly Val Thr Asp Leu Val Phe Glu Asp Leu Ser Leu He Gin 
180 185 190 



Arg Val Lys Thr Leu Leu Leu Asp Asp Pro Gin Pro He Val Ser Lys 
195 200 205 



Arg Ser He Gin Val Asp Gly Met Asn Asn Asp Leu Ala Cys Pro Ala 
210. 215 220 



Leu Asp Pro Leu lie Leu Ala Thr Lys Leu Ser Pro He Leu Gly Cys 
225 " 230 235 240 



Glu Gin Leu Glu Thr Val Ser Pro Asp Asp Ser Pro Asp Gly Leu Glu 
245 250 255 



Pro Lys Gin Ser Arg Glu Asp Ser leu Leu He Glu Gly He Asn Gly 
260 265 270 



Gly Ala Ser Gin Val Gin Ser Trp Gin Phe Met Asp Glu Glu Phe Cys 
275 280 285 



Asn Cys Val His His Ser Leu Asn Ser Ser Asp Cys He Ser Gin Thr 
290 - 295 300 



He Ala Asp His Arg Lys Val Val Pro Leu Tyr Arg Gly Glu Asn Asp 
305 310 315 320 



Asn Gly Leu Gin Asp Val Glu Glu Cys Asn Gin Thr Lys Leu Thr Ser 
325 330 335 



Phe Asp Arg Gin Asn Asp Asp Arg His Phe His Glu Val Leu Ser" Ala 
340 345 350 
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Leu Phe Lys Ser Ser His Pro Leu He Leu Gly Pro Gin Phe Arg Asn 
355 360 365 



Ser Asn Lys Glu Ser Ser Phe He Arg Trp Gin Lys Asn Gly Leu Val 
370 375 380 



Lys Pro Gin Lys Glu Arg Asp Glu Thr Pro Gin Lys Leu Leu Lys Lys 
385 390 395 400 



lie Leu Phe Leu Val Pro His Met His Asp Arg Gly Leu He Glu Ser 
405 410 415 



Pro Glu Thr Asn Ala Val Arg Asp Ala Ala Trp Arg Pro Glu Ala Asp 
420 425 430 



Glu He Cys Gly Asn His Val Leu Ser Glu Arg Lys Arg Arg Glu Lys 
435 440 445 



He Asn Glu Arg Leu Met Met Leu Lys Ser Leu Val Pro Ala Asn Asn 
450 455 460 



Lys Ala Asp Lys Val Ser He Leu Asp Val Thr He Glu Tyr Leu Gin 
465 470 475 480 



Thr Leu Glu Arg Arg Val Ala Glu Leu Glu Ser Cys Arg Lys Ser Glu 
485 490 495 



Ala Arg Thr Lys He Glu Arg Thr Ser Asp Asn Tyr Gly Asn Asn Lys 
500 505 510 



Thr Asn Asn Gly Lys Lys Ser Ser Leu Ser Lys Arg Lys Ala Tyr Asp 
515 * 520 525 



Val Val Asp Glu Ala Asp Gin Glu He Gly Tyr Val Ala Ser Lys Asp 
530 535 540 



Gly Ser Thr Asp Lys Val Thr Leu Ser Met Asn Asn Lys Glu Leu Leu 
545 550 555 560 



He Glu Phe Lys Cys Pro Trp Arg Glu Gly He Leu Leu Glu Val Met 
565 570 575 



Asp Ala Leu Ser He Leu Asn Leu Asp Cys His Ser Val Gin Ser Ser 
580 585 590 



Thr Thr Glu Gly He Leu Ser Leu Thr He Lys Ser Lys Tyr Lys Gly 
595 600 605 



Ser Ser Val Ala Lys Ala Gly Pro He Glu Gin Ala Leu Gin Arg He 
610 * 615 620 



Ala Ser Lys Cys 
625 



<210> 118 
<211> 123 
<212> PRT 

<213> Gossypium hirsutum 
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<400> 118 

Met Ala Ser Ser Gly Val Leu Lys Leu Val Ser Met He Leu Met Val 
1 5 10 15 

Cys Met Thr Met Met Ser Ala Pro Lys Ala Ala Lys Ala Ala He Thr 
20 25 30 



Cys Ser Asp Val Val Asn His Leu lie Pro Cys Leu Ser Tyr Val Gin 
35 40 45 



Asn Gly Gly Thr Pro Ala Ala Ala Cys Cys Ser Gly Val Lys Ala Leu 
50 55 60 



Tyr Gly Glu Val Gin Thr Ser Pro Asp Arg Gin Asn Val Cys Lys Cys 
65 1 70 75 80 



He Lys Ser Ala Val Asn Gly He Pro Tyr Thr Ser Asn Asn Leu Asn 
85 90 95 



Leu Ala Ala Gly Leu Pro Ala Lys Cys Gly Leu Gin Leu Pro Tyr Ser 
100 105 110 



lie Ser. Pro Ser Thr Asp Cys Asn Lys Val Gin 
115 120 



<210> 119 

<211> 362 

<212> PRT 

<213> Gossypium hirsutum 

<400> 119 

Met Ala Asn His Thr Val Thr Phe Leu Pro Lys Leu Ser He Glu Ala 
1 5 .10 15 . 



He Gin Thr Val Thr Pro Met Arg He Thr Glu Pro Arg Gin Thr Arg 
20 25 30 



Gin Val Leu Ala Gly Glu Leu Val Gly Pro Gly He Phe Gin Arg Cys 
35 40 45 



Leu Asn Val Val Gin Tyr Tyr Met Lys Glu Lys Glu Glu Asp Ser Gly 
50 55 60 



Trp Leu Leu Ala Gly Trp He Lys Glu Thr Leu Gly Arg Ala Leu His 
65 70 75 80 



Glu Gin Pro Met He Ser Gly Arg Leu Arg Lys Gly Glu Arg Asn Asp 
85 90 95 



Gly Glu Leu Glu He Val Ser Asn Asp Cys Gly He Arg Leu He Glu 
100 105 110 



Ala Arg He Gin Met Asn Leu Ser Asp Phe Leu Asp Leu Lys Gin Arg 
115 120 125 



Glu Asp Ala Glu Ala Gin Leu Val Phe Trp Lys Asp He Asp Glu Gin 
130 135 140 
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Asn Pro Gin Phe Ser Pro Leu Phe Tyr Val Gin Val Thr Asn Phe Gin 
145 150 155 160 



Cys Gly Gly Tyr Ser lie Gly lie Ser Cys Ser lie Leu Leu Ala Asp 
165 170 175 



Leu Leu Leu Met Lys Glu Phe Leu Lys Thr Trp Ala Asp lie His Asn 
180 185 190 



Lys Val He He Asn Lys Asn Asp Glu Gin Lys Leu Pro Leu Phe Tyr 
195 200 205 



Leu Pro Gly Leu Lys Asn Thr Asn Gly Ala Ser Pro Asn He He Thr 
210 215 220 



Ser Asn Ser Ser Lys Asn Ser Ala Lys Thr Met He Phe Gin He Gin 
225 230 235 240 



Ala Glu Thr Glu Ser Pro Gly Ser Asp Trp Cys Arg Lys Met Ala Leu 
245 250 255 



Ala Cys Leu Glu Glu Ala Glu Ser Asn Leu Gly Ser Val Val Gly Gly 
260 265 270 



Glu Phe Ser Leu Phe Val Asn Glu Ser Phe Glu Ser He Lys Val Glu 
275 280 • 285 



Ser Cys Ser Lys Gin Gly Met Ser Lys Glu Ala Glu Met Gly Val Leu 
290 295 300 



Asn Arg Ala Lys Trp Asp Asp Leu Gly Ala Asn Glu Val Ser Phe Gly 
305 310 315 320 



Asp Gly Asn Lys Pro Ala His Val Ser Tyr Trp Leu Arg Ser Thr Leu 
325 330 335 



Gly Gly Leu He He Val He Pro Ser Leu Gin Glu Asp Lys Tyr Thr 
340 345 350 



Val Asn He He Val Thr He Pro Ser Lys 
355 360 



<210> 120 

<211> 497 

<212> PRT 

<213> Gossypium hirsutum 

<400> 120 

Met Gly Phe Gin Arg Asn He Leu Gly Phe Leu Leu Leu He Leu Ala 
1 5 10 15 



Ser Leu Thr Ser Leu Ser Ser Ser Leu Pro Ser Glu Tyr Ser He Val 
20 25 30 



Glu His Glu He Asp Ala Phe Leu Ser Glu Glu Arg Val Leu Glu He 
35 40 45 



Phe Gin Gin Trp Lys Glu Lys Asn Gin Lys Val Tyr Arg Gin Ala Glu 
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50 55 60 

Glu Ala Glu Lys Arg Phe Glu Asn Phe Lys Gly Asn Leu Lys Tyr lie 
65 70 75 80 

Leu Glu Arg Asn Ala Lys Arg Lys Ala Asn Lys Trp Glu His His Val 
85 90 95 



Gly Leu Asn Lys Phe Ala Asp Met Ser Asn Glu Glu Phe Arg Lys Ala 
100 105 110 



Tyr Leu Ser Lys Val Lys Lys Pro lie Asn Lys Gly He Thr Leu Ser 
115 120 125 



Arg Asn Met Arg Arg Lys Val Gin Ser Cys Asp Ala Pro Ser Ser Leu 
130 135 140 

Asn Trp Arg Asn Tyr Gly Val Val Thr Ala Val Lys Asp Gin Gly Ser 
145 150 155 160 

Cys Gly Ser Cys Trp Ala Phe Ser Ser Thr Gly Ala Met Glu Gly He 
165 170 175 



Asn Ala Leu val Thr Gly Asp Leu He Ser Leu Ser Glu Gin Glu Leu 
180 185 190 



Val Asp Cys Asp Thr Ser Asn Tyr Gly Cys Glu Gly Gly Tyr Met Asp 
195 200 205 



Tyr Ala Phe Glu Trp Val He Asn Asn Gly Gly He Asp Ser Glu Thr 
210 215 220 



Asp Tyr Pro Tyr Thr Gly Val Asp Gly Thr Cys Asn Thr Thr Lys Glu 

-225 230 235 . ..240. 



Glu Thr Lys Val Val Ser He Asp Gly Tyr Gin Asp Val Glu Gin Ser 
245 250 255 



Asp Ser Ala Leu Leu Cys Ala Val Ala Gin Gin Pro Val Ser Val Gly 
260 265 270 



He Asp Gly Ser Ala He Asp Phe Gin Leu Tyr Thr Gly Gly He Tyr 
275 280 285 



Asp Gly Ser Cys Ser Asp Asp Pro Asp Asp He Asp His Ala Val Leu 
290 295 300 



He Val Gly Tyr Gly Ser Glu Gly Ser Glu Glu Tyr Trp He Val Lys 
305 310 315 320 



Asn Ser Trp Gly Thr Ser Trp Gly He Asp Gly Tyr Phe Tyr Leu Lys 
325 330 335 



Arg Asp Thr Asp Leu Pro Tyr Gly Val Cys Ala Val Asn Ala Met Ala 
340 345 350 



Set Tyr Pro Thr Lys Glu Ser Ser Ser Pro Ser Pro Tyr Pro Ser Pro 
355 . 360 365 
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Ser Val Pro Pro Pro Pro Pro Pro Ser Thr Pro Pro Pro Pro Pro Pro 
370 375 380 

Pro Ser Pro Ser Pro Ser Asp Cys Gly Asp Phe Ser Tyr Cys Ser Ser 
385 390 395 400 

Asp Glu Thr Cys Cys Cys Leu Phe Glu Phe Tyr Asp Tyr Cys Leu lie 
405 410 415 



Tyr Gly Cys Cys Glu Tyr Glu Asn Ala Val Cys Cys Thr Gly Thr Glu 
420 425 430 



Tyr Cys Cys Pro Ser Asp Tyr Pro lie Cys Asp Val Gin Glu Gly Leu 
435 ~~ 440 445 



Cys Leu Lys Asn Ala Gly Asp Tyr Leu Gly Val Ala Ala Arg Lys Arg 
450 455 460 

Lys Val Ala Lys His Lys Leu Pro Trp Thr Lys He Glu Glu Thr Glu 
465 470 475 480 

He Thr Tyr Gin Pro Leu Gin Trp Lys Arg Asn Pro Phe Ala Ala Met 
485 490 495 



Arg 



<210> 121 
<211> 335 
<212> PRT 

<213> Gossypium hirsutum 
<400> 121 

Met Lys Val Leu Ser Pro He Leu Ala Cys Leu Ala Leu Ala Val Val 
15 10 15 



Val Ser His Ala Ala Leu Ser Pro Glu Gin Tyr Trp Ser Tyr Lys Leu 
20 25 30 



"Pro Asn Thr Pro Met Pro Lys Ala Val Lys Glu lie Leu His Pro Glu 
35 40 45 



Leu Met Glu Glu Lys Ser Thr Ser Val Asn Val Gly Gly Gly Gly Val 
50 55 60 



Asn Val Asn Thr Gly Lys Gly Lys Pro Gly Gly Asp Thr His Val Asn 
65 70 75 80 



Val Gly Gly Lys Gly Val Gly Val Asn Thr Gly Lys Pro Gly Gly Gly 
85 90 95 



Thr His Val Asn Val Gly Asp Pro Phe Asn Tyr Leu Tyr Ala Ala Ser 
100 105 HO 



Glu Thr Gin He His Glu Asp Pro Asn Val Ala Leu Phe Phe Leu Glu 
115 120 125 
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Lys Asp Met His Pro Gly Ala Thr Met Ser Leu His Phe Thr Glu Asn 
130 135 140 



Thr Glu Lys Ser Ala Phe Leu Pro Tyr Gin Thr Ala Gin Lys lie Pro 
145 150 155 160 



Phe Ser Ser Asp Lys Leu Pro Glu lie Phe Asn Lys Phe Ser Val Lys 
165 170 175 



Pro Gly Ser Val Lys Ala Glu Met Met Lys Asn Thr lie Lys Glu Cys 
180 185 190 



Glu Gin Pro Ala lie Glu Gly Glu Glu Lys Tyr Cys Ala Thr Ser Leu 
195 200 205 



Glu Ser Met lie Asp Tyr Ser lie Ser Lys Leu Gly Lys Val Asp Gin 
210 215 220 



Ala Val Ser Thr Glu Val Glu Lys Gin Thr Pro Met Gin Lys Tyr Thr 
225 230 235 240 



lie Ala Ala Gly Val Gin Lys Met Thr Asp Asp Lys Ala Val Val Cys 
245 250 255 



His Lys Gin Asn Tyr Ala Tyr Ala Val Phe Tyr Cys His Lys Ser Glu 
260 265 270 



Thr Thr Arg Ala Tyr Met Val Pro Leu Glu Gly Ala Asp Gly Thr Lys 
275 280 285 



Ala Lys Ala Val Ala Val Cys His Thr Asp Thr Ser Ala Trp Asn Pro 
290 295 300 



Lys His Leu- Ala Phe -Gin Val -Leu Lys Val Glu Pro -Gly Thr He -Pro 
305 310 315 320 



Val Cys His Phe Leu Pro Arg Asp His He Val Trp Val Pro Lys 
325 330 335 



<210> 122 
<211> 302 
<212> PRT 

<213> Gossypium hirsutum 
<400> 122 

Met Glu- Arg Gin Arg Ser Lys Gin Val Cys Leu Leu Met Trp Val Leu 
15 10 15 



Val Ala Ala Phe Phe Ser His Asn Arg Val He Ala Val Thr Ser Thr 
20 25 30 



Gly Leu Gly Glu Gin Lys Asn Tyr Tyr Pro Ala Pro Asp Pro His Ala 
35 40 45 . 



Gly Thr Pro Pro Ser Gly Ser His Gly Thr Pro Pro Ser Ser Gly Gly 
50 55 60 



Gly Ser Pro Pro Ser His Gly Thr Pro Ser His Gly Gly Gly Tyr His 
65 70 75 80 



58 



WO 2005/121364 



PCT/IL2005/000627 



Pro Ser Pro Thr Pro Ser Thr Pro Ser Gly Gly Asn Cys Gly Thr Pro 
85 90 95 

Pro His Asp Pro Ser Thr Pro Ser Thr Pro Ser His Thr Pro Pro His 
100 105 HO 

Gly Thr Pro Pro Ser Ser Gly Gly Gly Ser Pro Pro Ser Tyr Gly Gly 
115 120 125 

Gly Ser Pro Pro Ser Tyr Gly Gly Gly Ser Pro Pro Ser Tyr Gly Gly 
130 135 140 

Gly Ser Pro Pro Ser Tyr Gly Gly Gly Ser Pro Pro Ser Tyr Gly Gly 
145 150 155 160 

Gly Ser Pro Pro Thr Thr Pro lie Asp Pro Gly Thr Pro Ser lie Pro 
165 170 175 

Ser Pro Pro Phe Phe Pro Ala Pro Thr Pro Pro lie Gly Gly Thr Cys 
180 185 190 

Asp Phe Trp Arg Ser His Pro Thr Leu He Trp Gly Leu Leu Gly Trp 
195 200 205 

Trp Gly Thr Val Gly Asn Ala Phe Gly Val Thr Asn Ala Pro Gly Leu 
210 215 220 

Gly Thr Ser Met Ser Leu Pro Gin Ala Leu Ser Asn Thr Arg Thr Asp 
225 230 235 240 

Gly Leu Gly Ala Leu Tyr Arg Glu Gly Thr Ala Ser Phe Leu Asn Ser 
245 ~ 250 255 

Met Val Asn Asn Arg Phe Pro Phe Ser Thr Lys Gin Val Arg Glu Thr 
260 265 270 

Phe Val Ala Ala Leu Gly Ser Asn Ser Ala Ala Ala Ala Gin Ala Arg 
275 280 285 



Leii Phe Lys Leu Ala Asn Glu Gly His Leu Lys Pro Arg Thr 
290 295 300 



<210> 123 
<211> 196 
<212> PRT 

<213> Gossypium hirsutum 
<400> 123 

Met Met Lys Arg Gly Phe He Val Leu Ala Leu Thr Val Val Phe Ala 
15 10 15 



Ala Thr Val Val Thr Ala Ala Asp Glu Ser Gly Leu Ala Asn Glu Cys 

20 25 30 

Ser Lys Asp Phe Gin Ser Val Met Thr Cys Leu Ser Phe Ala Gin Gly 
35 40 45 
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Lys Ala Ala Ser Pro Ser Lys Glu Cys Cys Asn Ser Val Ala Gly He 
50 55 60 

Lvs Glu Asn Lys Pro Lys Cys Leu Cys Tyr lie Leu Gin Gin Thr Gin 
6 5 70 75 80 

Thr Ser Gly Ala Gin Asn Leu Lys Ser Leu Gly Val Gin Glu Asp Lys 
85 90 95 

Leu Phe Gin Leu Pro Ser Ala Cys Gin Leu Lys Asn Ala Ser Val Ser 
100 105 HO 

i 

Asp Cys Pro Lys Leu Leu Gly Leu Ser Pro Ser Ser Pro Asp Ala Ala 
115 120 125 

He Phe Thr Asn Ser Ser Ser Lys Ala Thr Thr Pro Ser Thr Ser Thr 
130 135 140 

Thr Thr Ala Thr Pro Ser Ser Ala Ala Asp Lys Thr Asp Ser Lys Ser 
145 150 155 160 

Ser Gly He Lys Leu Gly Pro His Phe Val Gly Ser Thr Ala Ala Leu 
165 170 175 

Leu Val Ala Thr Ala Ala Val Phe Phe Leu Val Phe Pro Ala Gly Phe 
180 185 190 



Ala Ser He Val 
195 



<210> 124 

<211> 629 

<212> PRT 

<213> Gossypium hirsutum 

<400> 124 



Met Pro Val Val Asp Phe Ala Cys Val Phe Leu Val Ser Val Val Met 
15 10 15 

Phe Asn Leu Arg Val Ser Thr Glu Pro Val Glu Asp Lys Gin Ala Leu 
20 25 30 

Leu Ala Phe He Ser Gly He Arg His Ala Asp Arg Val Lys Trp Asn 
35 40 45 

Ser Ser Thr Ser Ala Cys Asp Trp Phe Gly Val Gin Cys Asp Ala Asn 
50 55 60 

Arg Ser Phe Val Tyr Thr Leu Arg Val Pro Gly Trp Gly Pro Tyr Gly 
65 70 75 80 

Val Arg Phe Arg Pro Lys Gin He Gly Arg Leu Asn Arg Leu Arg Val 
85 90 95 

Leu Ser Leu Arg Ala Asn Arg Leu Ser Gly Glu He Pro Ala Asp Phe 
100 105 HO 



Tyr Asn Leu Thr Gin Leu Arg Ser Leu Tyr Leu Gin Gly Asn Glu Phe 
115 120 125 
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Thr Gly Pro Phe Pro Pro Ser Val Thr Arg Leu Thr Arg Leu Thr Arg 
130 135 140 

Leu Asp Leu Ser Ser Asn Asn Phe Thr Gly Pro He Pro Leu Gly Val 
145 150 155 160 

Asn Asn Leu Thr Gin Leu Thr Lys Leu Phe Leu Gin Asn Asn Lys Phe 
165 170 175 

Ser Gly Ser Leu Pro Ser He Asp Ser Asp Gly Leu Asn Asp Phe Asn 
180 185 190 

Val Ser Asn Asn Asn Leu Lys Gly Ser He Pro Asp Ser Leu Ser Lys 
195 200 205 

Phe Pro Glu Ser Ser Phe Ala Gly Asn He Gly Leu Cys Gly Gly Pro 
210 215 220 

Leu Arg Pro Cys Asn Pro Phe Pro Pro Ser Pro Ser Pro Thr Glu Pro 
225 230 235 240 

He Pro Pro Lys Thr Ser Gly Gin Ser Ser Lys Ser Leu Pro Thr Gly 
245 250 255 

Ala He He Ala He Ala Val Gly Ser Ala He Val Ala Leu Leu Leu 
260 265 270 

Leu Leu Phe Leu He He Cys Phe Arg Lys Trp Lys Arg Lys Ser Pro 
275 280 285 

Arg Arg Gin Lys Ala He Pro Ser Thr Thr His Ala Leu Pro Val Glu 
290 295 300 



Glu Ala Gly Thr Ser Ser Ser Lys Asp Asp He Thr Gly Gly Ser Thr 
305 310 315 320 

Glu He Glu Arg Met Met Asn Asn Lys Leu Met Phe Phe Lys Gly Gly 
325 330 335 

Val Tyr Ser Phe Asp Leu Glu Asp Leu Met Arg Ala Ser Ala Glu Met 
340 " 345 350 

Leu Gly Lys Gly Ser Thr Gly Thr Ser Tyr Arg Val Val Leu Ala Val 
355 .360 365 

Gly Thr Thr Val Ala Val Lys Arg Leu Lys Asp Val Ala Val Ser Lys 
370 375 380 

Arg Glu Phe Val Met Lys Met Gly Met Leu Gly Lys He Met His Glu 
385 390 395 400 

Asn Val Val Pro Leu Arg Ala Phe Tyr Tyr Ser Asp Glu Glu Lys Leu 
405 410 415 

Leu Val Tyr Asp Tyr Met His Gly Gly Ser Leu Phe Ala Leu Leu His 
420 425 430 
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Gly Ser Arg Ser Ser Ala Arg Thr Pro Leu Glu Trp Asp Pro Arg Met 
435 440 445 

Lys He Ala Leu Gly Val Ala Arg Gly Leu Ala His Leu His Ser Ser 
450 455 460 

Gin Asn Met Val His Gly Asn He Lys Ser Ser Asn He Leu Leu Arg 
465 470 475 480 

Pro Asp His Glu Ala Cys He Ser Glu Phe Gly Leu Asn Ser Leu Phe 
485 490 495 

Asn Thr Asn Thr Pro Pro Ser Arg He Ala Gly Tyr Gin Ala Pro Glu 
500 505 510 

Val He Gin Thr His Lys Val Thr Val Lys Ser Asp Val Tyr Ser Phe 
515 520 525 

Gly Val Leu Leu Leu Glu Leu Leu Thr Gly Arg Ala Pro He Gin Pro 
530 535 540 

Ser He Thr Glu Glu Gly Phe Asp Leu Pro Arg Trp Val Gin Ser Val 
545 550 555 560 

Val Arg Glu Glu Trp Ala Ala Glu Val Phe Asp Ala Glu Leu Met Ala 
565 570 575 

Tyr His Asp He Glu Glu Glu Met Val Gin Ala Leu Gin Met Ala Met 
580 585 590 

Val Cys Val Ser Thr Val Pro Asp Gin Arg Pro Val Met Ser Glu Val 
595 600 605 



Val Arg Met He Gly Asp Met He Asp Arg Gly Gly Thr Asn Asp Gly 
610 615 620 



Thr Ala Ala Ala He 
625 



<210> 125 
<211> 545 
<212> PRT 

<213> Gossypium hirsutum 
<400> 125 

Met Ala Glu Met Ser Thr Leu Cys Thr Phe Leu Phe Ser Leu Leu Leu 
15 10 15 

Phe Ala Ser His Pro Leu He Leu Pro Thr Ala Ala Asp Gly Arg Trp 
20 25 30 

Gin Leu Leu Gin Lys Ser He Gly He Ser Ser Met His Met Gin Leu 
35 40 45 

Leu Lys Asn Asp Arg Val Val Met Tyr Asp Arg Thr Asp Phe Gly Pro 
50 55 60 

Ser Thr Leu Pro Leu Ala Ser Gly Lys Cys His Asn Asp Pro Thr Asn 
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65 70 75 80 

Thr Ala Val Gin Val Asp Cys Thr Ala His Ser Val Glu Tyr Asp Val 
85 90 95 

Leu Ser Asn Lys Phe Arg Ala Leu Thr Val Gin Ser Asn Val Trp Cys 
100 105 110 

Ser Ser Gly Gly Val Met Pro Asp Gly Lys Leu Val Gin Thr Gly Gly 
115 120 125 

Phe Ser Glu Gly Glu Leu Arg Val Arg Val Phe Ser Pro Cys Glu Ser 
130 135 140 

Cys Asp Trp His Glu Thr Pro Asn Gly Leu Ala Ala Lys Arg Trp Tyr 
145 150 155 160 

Ala Thr Asn His Val Leu Pro Asp Gly Arg Gin He Val Val Gly Gly 
165 170 175 

Arg Glu Gin Phe Asn Tyr Glu Phe Val Pro Lys Asn He Ala Ala Asp 
180 185 190 

Thr Phe Lys Leu His Phe Leu Ser Glu Thr Asn Glu Arg Gly Val Glu 
195 200 205 

Asn Asn Leu Tyr Pro Phe Val Phe Leu Asn Val Asp Gly Asn Leu Phe 
210 215 220 

He Phe Ala Asn Asn Arg Ala He Leu Leu Asp Tyr Val Asn Asn Lys 
225 230 235 240 

Val Val Lys Thr Tyr Pro Lys He Pro Gly Gly Glu Pro Arg Ser Tyr 
-245 - - 250 - ... . .-255 

Pro Ser Thr Gly Ser Ala Val Leu Leu Pro Leu Lys Asn Leu Thr Ala 
260 265 270 

Ala Thr He Gin Ala Glu Val Leu Val Cys Gly Gly Ala Pro Lys Gly 
275 280 285 

Ser Phe Val Gin Ala Leu Gin Gly Lys Phe Val Lys Ala Leu Asn Thr 
290 295 300 

Cys Ala Arg He Ser He Thr Asp Pro Lys Pro Lys Trp Val Leu Glu 
305 '310 315 320 

Thr Met Pro Leu Ala Arg Val Met Gly Asp Met Val Leu Leu Pro Asn 
325 330 335 

Gly Lys Val Leu Val He Asn Gly Ala Arg Ser Gly Ser Ala Gly Trp 
340 345 350 

Asp Leu Gly Arg Asp Pro Val Leu Asn Pro Val Leu Tyr Met Pro Asp 
355 360 365 

Asn Glu He Glu Ser Arg Phe Lys He Leu Asn Pro Thr Lys He Pro 
370 375 380 
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Ara Met Tyr His Ser Thr Ala Val Leu Leu Arg Asp Gly Arg Val Leu 
385 ^ 390 395 400 

val Glv Gly Ser Asn Pro His Ala Tyr Tyr Asn Phe Thr Gly Val Leu 
405 410 415 

Tv'r Pro Thr Glu Leu Ser Leu Glu Ala Phe Tyr Pro Gly Tyr Leu Asp 
420 425 430 

Ala Lys Phe Asn Asn Leu Arg Pro Thr He Val Ala Pro Lys Ser Met 
435 440 445 

Ser Gly He Arg Tyr Asn Lys Lys Leu Lys He Lys Val Val He Thr 
450 455 460 

Gly Glu Val Thr Leu Asn Leu Leu Ser Val Thr Met Val Ser Pro Ala 
465 470 475 480 

Phe Asn Thr His Ser Phe Ser Met Asn Gin Arg Leu Leu Val Leu Gly 
485 490 495 

Asn Asp Lys Val Met Ala Ser Gly Lys Ser Thr Tyr Glu He Glu Val 
500 505 510 

Met Thr Pro Gly Ser Gly Asn Leu Ala Pro Ala Gly Phe Tyr Leu Leu 
515 520 525 

Phe Val Val His Gin Asp He Pro Ser Gin Gly He Trp Val His Leu 
530 535 540 



Lys 
545 



<210> 126 
<211> 508 
<212> PRT 

<213> Gossypium hirsutum 
<400> 126 

-Met Gin He Leu Pro Phe Arg Gly Gly Ala Leu Val Cys Phe He Ala 
15 10 15 

Ser Leu Leu Phe Val Ala Ser Phe Cys Asn Ala Asp Ala Lys Thr Val 
20 25 30 



Glu Val Val Gly Ala Gly Glu Cys Ala Asp Cys Ala Glu Asn Asn Leu 
35 40 45 



Glu He Ser Gin Ala Phe Ser Gly Leu Arg Val Ser He Asp Cys Lys 
50 55 60 



Pro Glu Asn Gly Lys Asn Phe Lys Thr Arg Gly Ser Gly Glu Leu Asp 
65 70 75 80 

Lys Gin Gly Asn Phe Lys Val Phe Val Pro Glu Asp Leu Val Glu Asn 
85 90 95 
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Gly Glu Leu Lys Glu Glu Cys Tyr Ala Gin Leu His Ser Val Ser Ala 
100 105 110 

Ala Pro Cys Pro Ala His Asp Gly Leu Glu Ser Ala Lys Leu Val Leu 
115 120 125 



Lys Ser Arg Ser Asp Gly Lys His Gly Phe Gly Leu Lys Gly Lys Leu 
130 135 140 

Arg Phe Ser Pro Leu Thr Cys Ala Ser Ala Phe Phe Trp Pro His Phe 
145 150 155 160 

Lys Phe Pro Pro Leu Pro Lys Trp Asn His Pro Pro Leu Pro Lys Phe 
165 170 175 



Pro Leu Pro Pro Phe Lys Gly Phe His His His Tyr Pro lie lie Pro 
180 185 190 



Pro lie Tyr Lys Lys Pro Leu Pro Pro Pro Ser Pro Val Tyr Lys Pro 
195 200 205 



Pro Pro Val Pro Val Asn Pro Pro Val Pro lie Tyr Lys Pro Pro Pro 
210 215 220 



Val Pro Val Tyr Lys Pro Pro Pro Val Pro Val Lys Pro Leu Pro Pro 
225 " 230 235 240 



Pro Val Pro lie Tyr Lys Pro Pro Pro Val Glu Lys Pro His Pro Pro 
245 250 255 



Pro Val Pro Val Tyr Lys Pro Pro Pro Val Pro Val Tyr Lys Lys Pro 
260 ' 265 270 



■Cys -Pro Pro Pro Val Pro Val- Tyr- Lys Ser Pro- Pro -Val Pro -Val -Tyr. 
275 280 285 



Lys Lys Pro His Pro Pro Pro Val Pro Val Tyr Lys Lys Pro His Pro 
290 295 300 



Pro Pro Val Pro Val Tyr Lys Lys Pro Cys Pro Pro Pro Val Pro Val 
305 310 315 320 



Tyr Lys Ser Pro Pro Val Pro Glu Pro His Pro Pro Pro Val Pro Val 
.325 330 335 



Tyr Lys Lys Pro His Pro Pro Pro Val Pro Val Tyr Lys Lys Pro Cys 
340 345 350 



Pro Pro Pro Val Pro Val Tyr Lys Ser Pro Pro Val Pro Glu Pro His 
355 360 365 



Pro Pro Pro Val Pro Val His Lys Pro Pro Pro Val Pro Val Tyr Lys 
370 375 380 



Lys Arg Val Pro Pro Pro Val Pro He Tyr Lys Pro Pro Pro Val Pro 
385 390 3.95 400 



Val Tyr Asn Lys Pro Leu Pro Pro Pro Val Pro Val Tyr Thr Lys Pro 
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405 410 415 

Leu Pro Pro Pro Val Pro Thr Tyr Lys Pro Lys Pro Leu Pro Pro lie 
420 425 430 

Pro Tyr Lys Pro Leu Pro Pro Leu Pro Lys lie Pro Pro Phe Pro Lys 
435 440 445 

Lys Pro Cys Pro Pro Leu Pro Lys Leu Pro Pro Leu Pro Lys lie Pro 
450 455 460 

Pro Lvs Tyr Phe His His His Pro Pro Leu Pro Lys Leu Pro Pro Leu 
465 470 475 480 

Pro Lys Tie Pro Pro Lys Tyr Phe His His His Pro Lys Phe Gly Lys 
485 490 495 



Trp Pro Ser Leu Pro Pro Phe Ala Pro His His Pro 
500 505 
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